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CAN EVERYTHING BE COMPUTED? - ON THE SOLVABILITY COMPLEXITY INDEX AND 

TOWERS OF ALGORITHMS 

J. BEN-ARTZI, A. C. HANSEN, O. NEVANLINNA, AND M. SEIDEL 


ABSTRACT. This paper addresses and establishes some of the fundamental barriers in the theory of computations 
and finally settles the long standing computational spectral problem. 

Due to the barriers presented in this paper, there are many problems, some of them at the heart of compu¬ 
tational theory, that do not fit into the classical frameworks of complexity theory. Hence, we are in need for a 
new extended theory of complexity, capable of handling these new issues. Such a theory is presented in this 
paper. Many computational problems can be solved as follows: a sequence of approximations is created by an 
algorithm, and the solution to the problem is the limit of this sequence (think about computing eigenvalues of a 
matrix for example). However, as we demonstrate, for several basic problems in computations (computing spec¬ 
tra of infinite dimensional operators, solutions to linear equations or roots of polynomials using rational maps) 
such a procedure based on one limit is impossible. Yet, one can compute solutions to these problems, but only 
by using several limits. This may come as a surprise, however, this touches onto the definite boundaries of com¬ 
putational mathematics. To analyze this phenomenon we use the Solvability Complexity Index (SCI). The SCI is 
the smallest number of limits needed in order to compute a desired quantity. In several cases (spectral problems, 
inverse problems) we provide sharp results on the SCI, thus we establish the absolute barriers for what can be 
achieved computationally. For example, we show that the SCI of spectra and essential spectra of infinite matrices 
is equal to three, and that the SCI of spectra of self-adjoint infinite matrices is equal to two, thus providing the 
lower bound barriers and the first algorithms to compute such spectra in two and three limits. This finally set¬ 
tles the long standing computational spectral problem. We also show that the SCI of solutions to infinite linear 
systems is two. 

Moreover, we establish barriers on error control. We prove that no algorithm can provide error control on 
the computational spectral problem or solutions to infinite-dimensional linear systems. In particular, one can 
get arbitrarily close to the solution, but never knowing when one is ’’epsilon” away. This is universal for all 
algorithms regardless of operations allowed. In addition, we provide bounds for the SCI of spectra of classes of 
Schrodinger operators, thus we affirmatively answer the long standing question on whether or not these spectra 
can actually be computed. Finally, we show how the SCI provides a natural framework for understanding barriers 
in computations. It has a direct link to the Arithmetical Hierarchy, and in particular, we demonstrate how the 
impossibility result of McMullen on polynomial root finding with rational maps in one limit, and the framework 
of Doyle and McMullen on solving the quintic in several limits, can be put in the SCI framework. 
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1. Introduction 

Let us start with the old and famous computational problem: can one compute the zeros of a polynomial by 
using only finitely many arithmetic operations and radicals of the polynomial coefficients? The answer is of 
course well known: yes, if the degree is less than five, no, if the degree is five or higher. However, what if we 
are also allowed to pass to a limit? In particular, can one construct a sequence of sets, where the construction 
of each element requires finitely many arithmetic operations and radicals, such that this sequence converges 
to the zeros of the polynomial? Indeed, the answer is yes, and this gives us a technique to compute zeros of 
polynomials (through a limiting process). Given the equivalence between roots of polynomials and spectra of 
matrices, this also provides a way of computing eigenvalues of matrices, given the matrix values. But what 
happens if the matrix becomes infinite (as an operator on / 2 (M)): can one then compute the spectrum by 
using finitely many arithmetic operations and radicals and then pass to the limit as in the finite-dimensional 
case? 

Consider another fundamental computational problem. Given a Schrodinger operator H = — A + V that 
is defined on some appropriate domain so that the spectrum is uniquely determined by the potential, can one 
compute the spectrum by using arithmetic operations and radicals on the point samples V ( x ) where x £ R“ 
and then take limits? Note that this problem differs from the matrix problem in that one can only access 
V(x) and not matrix elements. 

The former problem was only recently partially solved affirmatively in ll49l . However, the computation in 
this case is done with three limits, not one as is needed in the finite-dimensional case. This begs the question: 
is this a barrier that cannot be broken, does one really need several limits to do the computation? Given that 
the former problem was only recently partially solved, it is not a surprise that the latter problem is still open 
(although solutions exists in some special cases). It is an intriguing observation that it is more than eighty 
years since Schrodinger won his Nobel Prize in physics, yet how to compute spectra of arbitrary Schrodinger 
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operators is still unknown. The key question is therefore: can this be done with a finite number of limits (as 
in the matrix case), and if so what is the smallest number of limits needed? 

Let us consider another basic problem in computation. What if we want to compute a root of a polynomial, 
however we are not allowed to use radicals, but rather a rational map applied iteratively (such as Newton’s 
method). The problem with Newton’s method is that it may not converge. This problem prompted S. Smale 
|93l to ask whether there exists an alternative to Newton’s method, namely, a purely iterative generally 
convergent algorithm (see Section [8] for definition). Smale asked: “Is there any purely iterative generally 
convergent algorithm for polynomial zero finding ?” His conjecture was that the answer is ‘no’. This problem 
was settled by C. McMullen in l67l as follows: yes, if the degree is three; no, if the degree is higher (see 
also Il68ll95l ). However, in l(38l P. Doyle and C. McMullen demonstrated a striking phenomenon: this 
problem can be solved in the case of the quartic and the quintic using several limits. In particular, they provide 
a construction such that, by using several rational maps and independent limits, a root of the polynomial can 
be computed. 

It turns out that the examples above touch onto the absolute barriers in the theory of computations. In 
particular, many of these highly important problems cannot be solved by passing to a single limit, however, 
they can be solved by using several limits. This is a rather intriguing phenomenon and is important as 
more limits make the computations more complex. In this paper we provide a unifying framework for these 
kinds of questions by introducing a general concept of the Solvability Complexity Index (SCI) and towers 
of algorithms. This is done by generalizing the frameworks in |49| and (38). The SCI of a computational 
problem is the least amount of limits needed in order to compute the desired quantity, given a certain set of 
allowed operations. Many of the basic computational problems (from the insolvability of the quintic and the 
negative answer to Smale’s question, to questions on computing spectra of infinite matrices or Schrodinger 
operators, solutions to linear systems and even the Arithmetical Hierarchy) fit into this abstract framework. 
Moreover, we provide sharp bounds on the SCI for several of these problems and thus demonstrate solutions 
to - and barriers for - fundamental questions in computational mathematics. Some of the highlights are the 
following. 

Main results: 

(i) It is impossible to compute spectra and essential spectra of infinite matrices in less than three lim¬ 
its. This is universal for all algorithms regardless of the operations allowed (arithmetic operation, 
radicals etc). The only assumption on the algorithms is that they can only read a finite amount of 
information in each iteration step. This implies that even if there had existed an algorithm that could 
compute the spectrum of a finite dimensional matrix using finitely many arithmetic operations (of 
course no such algorithm exists), one could still not compute the spectrum of an infinite matrix 
in less than three limits. However, it is possible to compute spectra and essential spectra in three 
limits when allowing arithmetic operations of complex numbers. This finally settles the general 
computational spectral problem as considered in |49l . 

(ii) It is impossible to compute spectra of self-adjoint infinite matrices in less than two limits. This is, 
as above, universal and regardless of the operations allowed. However, it is possible to compute 
spectra of matrices with controlled growth on their resolvent in two limits (this then includes normal 
operators). It is possible to compute spectra of tridiagonal self-adjoint infinite matrices in one limit, 
but getting the essential spectrum is impossible, for that one needs two limits. 

(iii) One can compute spectra of all non-normal Schrodinger operators with bounded potential with 
bounded local total variation in two limits. If the operator is normal the number of limits is one (actu¬ 
ally we prove much more, only knowledge on the growth of the resolvent is needed). If the potential 
blows up at infinity, the spectrum can be computed in one limit, regardless of non-normality. This 
establishes an affirmative answer to the long standing problem of computing spectra of Schrodinger 
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operators for a vast class of potentials. Note that the techniques used here also pave the way for even 
more classes of potentials. 

(iv) It is impossible to compute the solution to a general infinite linear system in one limit (this is uni¬ 
versal for all algorithms regardless of the operations allowed), yet it is possible in two. The same 
goes for the problem of computing the norm of the inverse of an infinite matrix. For matrices with 
known/controllable off diagonal decay, one can compute the solution to a linear system in one limit. 

(v) Many problems that have SCI greater than one can never be computed with error control. In other 
words it is impossible to design an algorithm that can compute an approximation to the solution and 
know when one is ’’epsilon” away from the true solution. This is the case of many spectral problems 
and also the case for solutions to linear systems. Thus, in these cases, no-one can ever know with 
certainty, how close the computation is to the true solution. This is a universal statement for any 
possible algorithm. 

(vi) The SCI framework provides a new complexity theory for problems that do not fit into the existing 
complexity theories. In particular, current complexity theory cannot handle problems that requires 
several limits in the computation. We predict that this class of problems is vast, although this paper 
only exhibit a subset of this potentially big class. However, it already contains some of the core 
problems in computations such as spectral problems and inverse problems as well as polynomial 
root finding with rational maps. 

Note also that the SCI concept addresses the basic problem of computing with limits (Problem 5, 
p. 43, El) as posed by Blum, Shub and Smale. 

(vii) When considering decision problems there is a clear connection between the SCI and the Arithmeti¬ 
cal Hierarchy. In particular, the A m sets in the Arithmetical Hierarchy can equivalently be charac¬ 
terised in term of the SCI. Thus, one may view the SCI as a classification tool that is a generalisation 
of this complexity hierarchy to arbitrary computational problems. The link to the Arithmetical Hi¬ 
erarchy implies that the SCI can become arbitrarily large, i.e. for any k £ N there is a problem such 
that the SCI is equal to k. 

Remark 1.1 (Polynomial time algorithms and implementation). Note that all upper bounds on the SCI 

provided in this paper are proved constructively and hence yield actual implementable algorithms that can be 
used in practice. Moreover, all of them are indeed fast, meaning that the output produced, for each step of the 
iterations, is done in polynomial time in the number of input the algorithm reads. For actual implementations 
see lf5Qll . See also 14811491 . 

Remark 1.2 (Existing complexity theory vs. the Solvability Complexity Index). Note that, without 
going into details, classical complexity theory can essentially be summed up by the following two classes of 
problems: 

(I) The problem can be solved in finite time, and the task is to analyze how difficult this is. The 
complexity theory of these types of problem hosts the famous “P versus NP” problem that is one 
of the major open problems in mathematics today l99l . 

(II) The problem cannot be solved in finite time, however, it can be solved by computing a sequence 
of approximations, and the solution to the problem is the limit of the approximations. The task 
is to analyze the difficulty of carrying out the approximation procedure. The complexity theory of 
such problems contains famous questions, among them Smale’s 17th problem |[7ll23l . and a very rich 
mathematical theory HI 11I24I901I93I94I . The field of information based complexity theory I7ll74|96l 
is devoted specifically to these types of problems. 
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The key issue is that, as this paper establishes, there are fundamental barriers that prevent many problems at 
the heart of computational theory from fitting into the above frameworks. In particular, there is a third class 
of problems. 

(Ill) The problem cannot be computed by approximations and then passing to a limit, however, it can be 
computed by approximations and then passing to several limits. 

The theory of the Solvability Complexity Index offers a new view of complexity theory that bridges this 
gap. The theory is very flexible and does indeed incorporate the classical approaches. In particular, the class 
I discussed above is included in the set of problems that have SCI equal to zero, and the class II is included 
in the set of problems with SCI equal to one. This new extended complexity theory does not have to assume 
a machine (although it certainly can, as discussed in Section 17721 ) like the Turing machine 1981 or a Blum- 
Shub-Smale machine El The theory of the SCI and towers of algorithms only specify which mathematical 
operations are allowed. This allows essentially most computational problems into this framework ranging 
from analysis to recursion theory. 

Remark 1.3 (The SCI and a new classification theory). Note that although this paper establishes several 
classification results with the SCI, it leaves a vast set of open problems. In particular, one must now classify 
all problems according to their SCI. And it is of utmost importance to get an understanding of what kind of 
structure and extra information that allows one to lower the index. We predict that the class of problems with 
SCI greater than one is vast, see Section[9]for details. 

How to read the paper: For the reader only interested in the main results. Section [2] is the only prereq¬ 
uisite. The following sections after Section [2] present the main results from different parts of mathematics, 
however, these are completely self-contained. If the reader is interested in the proof techniques, an easy in¬ 
troduction is in the proofs of some of the basic decision problems. Thus, Section l7TTI is a good start followed 
by SectionfT4l 


2. The Solvability Complexity Index and towers of algorithms 

Throughout this paper we assume the following: 

(2. la) f2 is some set, called the primary set, 

(2. lb) A is a set of complex valued functions on f2, called the evaluation set, 

(2.1c) A4 is a metric space, 

(2. Id) E is a mapping 12 —> A4 , called the problem function. 

The set 12 is essentially the set of objects that give rise to our computational problems. It can be a family 
of matrices (infinite or finite), a collection of polynomials, a family of Schrodinger (or Dirac) operators with 
a certain potential etc. The problem function S : 12 —> Ai is what we are interested in computing. It could 
be the set of eigenvalues of an n x n matrix, the spectrum of a Hilbert (or Banach) space operator, root(s) of 
a polynomial etc. Finally, the set A is the collection of functions that provide us with the information we are 
allowed to read, say matrix elements, polynomial coefficients or pointwise values of a potential function of 
a Schrodinger operator, for example. 

In most cases it is convenient to consider a metric space Al, however, in the case of polynomials it may 
be more useful to use a pseudo metric space (see Example 12. II (IIP ). To explain this rather abstract setup in 
(12. Il l we commence with the following examples: 

Example 2.1. (I) (Spectral problems) Let f2 = the set of all bounded linear operators on a 

separable Hilbert space 'H, and the problem function E be the mapping A i-a sp(A) (the spectrum of 
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A). Here (M.. d) is the set of all compact subsets of C provided with the Hausdorff metric d = d\\ 
(defined precisely in (13. It ). The evaluation functions in A could for example consist of the family of 
all functions J : A i —> ( Aej , ef), i,j £ N, which provide the entries of the matrix representation 
of A w.r.t. an orthonormal basis { e I },; (= is|. Of course, fl could be a strict subset of B(TL), for example 
the set of self-adjoint or normal operators, and 3 could have represented the pseudo spectrum, the 
essential spectrum or any other interesting information about the operator. 

(II) (Inverse problems) Let fl = B\ m (Td.) x T~L, where ) denotes the set of all bounded invertible 

operators on 'H, and let the problem function E be the mapping (A, b) A ~ 1 b, which assigns to a 
linear problem Ax = b its solution x. The metric space AA would simply be 'H and A the collection 
of mappings {fi,j}ien,jeZ+ where f itj : ( A,b ) ( Aej,ei } for j £ N and f ifi : ( A,b ) i-A (b, ef). 

Also here fl could consist of operators with specific properties (off diagonal decay, self-adjointness, 
isometric properties). 

(III) (Polynomial root finding) Let fl = P s , the set of polynomials of degree < s over C and let the 
problem function S be the mapping p 4 {a £ C p(a) = 0} (the roots of p ). Let (AA,d) 
denote the collection of finite sets of points inC = CU{oo} equipped with the pseudo metric d : 
M x M -> [0, oo], defined by d(x, y) = mini <i<n,i<j<m \xj - y%\, where x = {xi,.. .,x n },y = 
{j/i, ■ ■ ■ i Dm} € AA. The reason for the pseudo metric is that the techniques of Doyle and McMullen 
that we will consider are based on computing a single root of a polynomial (as for example Newton’s 
method does). In this case A is the finite set of functions {/j}* =1 where fj : p i-> ay forp(f) = 

Efc=l a kt k . 

(IV) (Computational quantum mechanics) Let fl = L°°(M d ) fl C(M' / ) and let S : V sp(—A + V ), 

where the domain V(— A + V) = W 2,2 ^ 6 *) (the standard Sobolev space) and —A + V is the 
usual Schrodinger operator. Given that the spectra are unbounded, we cannot use the Hausdorff 
metric anymore, but will let (A4,dAw) denote the set of closed subsets of C equipped with the 
Attouch-Wets metric (see (14.21) ). In this case a natural choice of A would be the set of all evaluations 
f x : V V(x), x £ R d . 

(V) (Decision making) Let fl denote the set of infinite matrices with values in {0,1} and 3 : fl —> 
A4 = {Yes, No} where A4 is equipped with the discrete metric d x n sc . The evaluation functions 
would naturally be / ; ; j : A i—> Aij, i,j £ N, the (i, j)th matrix coordinate of A. A typical example 
of 5 could be: 3({Ajy}): Does { A t J } have a column containing infinitely many non-zero entries? 
Naturally, fl can be replaced with the natural numbers including zero Z + and 5 could be a question 
about membership in a certain set, as in classical recursion theory. In this case the evaluation set 
would be A = {A} consisting of the function A : Z + —> C, x x. 

Given this setup and motivation, we can now define what we mean by a computational problem. 

Definition 2.2 (Computational problem). Given a primary set fl, an evaluation set A, a (pseudo) metric 
space M. and a problem function 5 : fl —> A4 we call the collection {5, fl, M. , A} a computational problem. 

Our aim is to find and to study families of functions (that we will sometimes refer to as algorithms) 
which permit to approximate the function 5. The main pillar of our framework is the concept of a tower of 
algorithms. However, before that we will define a general algorithm. 

Definition 2.3 (General Algorithm). Given a computational problem {3, fl, A4, A}, a general algorithm 
is a mapping T : fl —> AA such that for each A £ fl: 

(i) there exists a finite subset of evaluations Ar(A) C A, 

(ii) the action of Y on A only depends on {Af} /gA r (A) where Af := f(A), 

(iii) for every B £ fl such that Bf = Af for every f £ Ar(A), it holds that Ap (B) = Ap(A). 
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We will sometimes write r({A/}^ e Ar(,4))> ;,i order to emphasize that T(A) only depends on the results 
{ Af } r (A) of finitely many evaluations. 

Note that for a general algorithm there are no restrictions on the operations allowed. The only restriction 
is that it can only take a finite amount of information, though it is allowed to adaptively choose the finite 
amount of information it reads depending on the input (which may very well be infinite, say an infinite 
matrix, or a function). The condition (iii) just ensures that the algorithm is well defined and consistent since, 
put in simple words, changing the input A shall not affect the algorithm’s action as long as the change does 
not affect the output of the relevant evaluations in Ar(-A). 

Definition 2.4 (Tower of algorithms). Given a computational problem {5, f2, Af, A}, a tower of algo¬ 
rithms of height k for {2, fl, Ai, A} is a family of sequences of functions 

T rafc : — ^ A4, 

Ifik-rik-i • ^ ~^ 

^ ^ Ai , 

where rife,..., ni £N and the functions T nfci . ni at the lowest level in the tower are general algorithms in 
the sense ofDefiniton \2.3\ Moreover, for every A £ f2, 

S(A) = lim r„ fc (A), 

rik—> oo 

= 1 ™ Tn k ,n k _AA), 

n.k- 1 — voo 

( 2 . 2 ) 

r nk,...,n 2 {A) = lim T nfc m(A), 

ni—foo 

where S = limn-^ S n means convergence S n —> S in the (pseudo) metric space Ai. 

In this paper we will discuss several types of towers: Doyle-McMullen towers, Kleene-Shoenfield towers. 
Arithmetic towers. Radical towers and General towers. A General tower will refer to the very general 
definition in Definition 12.41 specifying that there are no further restrictions as will be the case for the other 
towers. When we specify the type of tower, we specify requirements on the functions ... ,r rai 

in the hierarchy, in particular, what kind of operations may be allowed. Thus, a tower of algorithms for 
a computational problem is essentially the toolbox allowed. The Doyle-McMullen tower appeared first in 
the paper of Doyle and McMullen l38l (but then only referred to as a tower of algorithms). The Kleene- 
Shoenfield towers describe the Arithmetical Hierarchy known from Classical Recursion Theory as we will 
see in Section [7] A Radical tower, as defined below, first appeared in |[49| where it was referred to as a “set 
of estimating functions” for computing spectra. The definition here is substantially more general and allows 
for the use of these types of towers for a wide range of problems. 

Given the definition of a tower of algorithms we can now define the main concept of this paper: the 
Solvability Complexity Index (SCI). The SCI was first discussed in ||49l for a specific spectral problem, 
however, this definition extends to include general problems in computations. 

Definition 2.5 (Solvability Complexity Index). Given a computational problem {H, 11. A4, A}, it is said 
to have Solvability Complexity Index SCI(S, fl, Ai, A) a = k with respect to a tower of algorithms of type 
a if k is the smallest integer for which there exists a tower of algorithms of type a of height k. If no such 
tower exists then SCI(S, f2, Ad, A) a = oo. If there exists a tower {r„}„ g N of type a and height one such 
that 2 = T ni for some n i < oo, then we define SCI(2, Q,A4, A) a = 0. 
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The key goal with the SCI is that many problems can be put into this framework and analyzed. In 
particular, given a computational problem {H, G. AL A} and a tower of algorithms of a certain type, we are 
interested in the question: what is the smallest height of a tower possible? If it is finite, it means that the 
problem can be computed, if it is infinite it cannot be computed with such a tower. However, if it is finite, 
this number says something about how difficult it is to compute. Having several limits in the computation 
makes the computation more demanding. As we will see later in the paper many problems have SCI greater 
than one. Note that the SCI is a characteristic of a computational problem {£, Cl, Ad. A}. Thus, if any of 
the elements Cl, Ad, A are changed the SCI may also change. As we will see in many cases, the change of 
domain Cl changes the SCI, as may changes of Ad and A. 

We can now define an Arithmetic tower of algorithms and a Radical tower of algorithms (for the definition 
of a Doyle-McMullen tower of algorithms see Definition l8.1l in Section[8j for Kleene-Shoenfield see Defini- 
tion l7.8l in Section[7] and notice that both are special types of Arithmetic towers in these special settings). 

Definition 2.6 (Arithmetic and Radical towers). Given a computational problem {S, Cl, Ad, A} we define 
the following: 

(i) An Arithmetic tower of algorithms of height kfor {5, Cl, Ad, A} is a tower of algorithms where the 
lowest functions T = T„ fci ... ini : Cl —> Ad satisfy the following: For each A £ Cl the action of 
T on A consists of only performing finitely many arithmetic operations on {A/}j g A r (A) where we 
remind that Af = f(A). 

(ii) A Radical tower of algorithms of height k for {£, Cl, Ad, A} is a tower of algorithms where the 

lowest functions T = : Cl —> Ad satisfy the following: For each A £ Cl the action of T 

on A consists of only performing finitely many arithmetic operations on and extracting radicals of 
{^/}/GA r (A)- 

Remark 2.7. To state the definition of Arithmetic towers in other words one may say that for the finitely 
many steps of the computation of the lowest functions T = r nfci ... )ni : Cl —> Ad only the four arithmetic 
operations +, —, ■, / within the smallest (algebraic) field which is generated by the input {A/}/ g A r (A) are 
allowed. When we use the word radical we mean that we can evaluate the mapping a i—>■ (/a for a > 0 and 
n £ N. Also, in both Arithmetic and Radical towers we implicitly assume that any complex number can be 
decomposed into a real and an imaginary part, and moreover we can determine whether a = b or a > b for 
all real numbers a, b which can occur during the computations. 

Remark 2.8. In this paper we will mostly be concerned with Arithmetic towers (as opposed to Radical 
towers used in |49| ). The reason is that most of the problems considered in this paper can be solved without 
resorting to radicals. It should be mentioned that in a practical setting a Radical tower may be preferable for 
stability reasons. This is beyond the scope of this paper and we refer to l50l for details. 

When considering the SCI of a computational problem {S, Cl, Ad, A} with respect to a tower of certain 
type we will use the following notation: 

SCI(S,fi, Ad, A) DM , SCI(H, f], AI, A) ks , 

SCI(3, Cl,M,A.)\, SCI(5,n,A/f,A) R , SCI(5, Cl, M, A) G 

to denote the SCI with respect to Doyle-McMullen, Kleene-Schoenfield, Arithmetic, Radical and General 
towers respectively. Note the inequalities: SCI(S, Cl, Ad, A)a > SCI(S, Cl, Ad, A)r > SCI(S, Cl, Ad, A)g 
that are obvious. Given the rather abstract setup, let us consider some simple and clarifying examples. 

Example 2.9. (I) (Spectra ofnxn matrices and roots of polynomials) Let Cl 1 = C 4x4 and Cl 2 = 

C 5x5 and let S : A i —> sp(A). Let also Ad and A be defined as in Example 12. II (If above. Then, 
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obviously SCI(S, fii, Ad, A)r = 0 as it is possible to express the eigenvalues of the matrix by using 
finitely many arithmetic operations and radicals of the matrix elements. However, the eigenvalues 
cannot be expressed without radicals, thus SCI(S, 12 1 , Ad, A)a > 0. The situation is different when 
considering 1L- In particular, because of the insolvability of the quintic using radicals we must 
have SCI(S, fl 2 , Af, A)r > 0. Moreover, showing the insolvability of the quintic is obviously 
equivalent to showing SCI(S, 122, Ad, A)r > 0. Is it then clear that SCI(5, 122, Ad, A)r = 1? 
Note that none of the standard approaches such as the QR-algorithm gives this result as they are not 
globally convergent. However, as a result of the developments in ll49l it is relatively easy to show 
that SCI(H,J2i,Ad, A) R = 1. 

(II) (Spectra of compact operators) Let f2 = /C(Z 2 (N)) C £>(Z 2 (N)) denote the set of compact oper¬ 
ators on Z 2 (N). We assume access to the matrix elements, so A is as above, as is AT To build 
a Radical tower of height two for {3, AT, A} consider the following. Define P m to be the 
projection onto spanjei,... ,e m } (where {e ra } n£ N is the canonical basis). For A £ fl, define 
Tm 5 n(A) = Y m n (^P rn AP rn \Yi an ^p 7n 'j'), and r m (A) = sp(T > mAT > m |R an ^p m ^), wheie is a 

Radical tower of algorithms such that T m ,n{B) —> sp (B) for every B £ S(Ran(P m )) and every 
to, as n —> oo. Note that this is possible by the example above with to x to matrices. In particu¬ 
lar, we have T m (A) = linin^oo r m ra (A) and 3(A) = linim^oo T m (A) lf39l Part II, XI.9 Lemma 
5], which yields the bound SCI(3, 12, Ad, A)r < 2. Actually, we will show in Theorem l3.7l that 
SCI(S, 12, Ad, A)a = 1. 

Example 2.10 (Assumptions on A). The set A decides what the algorithm can read, and depending on 
what A may contain, the SCI may change as the following example demonstrates: 

Given a cylinder of radius r > 0 and height 1, compute a cube, more precisely its edge length a, of 
the same volume. 

Thus, f2 = (0, oo) shall be the set of all such radii, Ad = (0, oo), A simply contains the evaluation / : r H r, 
and the problem function S : 12 —> Ad maps each radius r to the desired edge length a, respectively. For¬ 
mally, we have a = y'nr 2 , and we immediately see that SCI(3, 12, Ad, A)a > SCI(S, 12, Al, A)r > 1: 
otherwise there would exist an algorithm for the computation of a, only performing finitely many arithmetic 
operations on and extracting radicals of the radius r (and the interim results). This algorithm would par¬ 
ticularly work on the restricted problem (3, {1}. AT A). Consequently there would also be a finite Radical 
algorithm for the computation of a 3 = ir, contradicting the fact that tv is transcendental. 

Now one may assume that this one critical constant n is already available (from e.g. some oracle, some 
precomputation or somewhere else). Then there are two questions arising: How can we provide the Arith¬ 
metic (resp. Radical) algorithms with this additional tool? and what is the resulting SCI? 

For the former there is a very simple solution: Just supplement A by a further evaluation function, namely 
the constant function g : r i—x 7r which outputs n for every input r £ !!, By doing this one actually adapts the 
computational problem (5,12, Ad, A) to the new problem (3,12, Ad, A) with A = {/, g}\ “Given r compute 
a, under the preassumption that tv is already available”, where this additional knowledge comes into the 
algorithm just as additional input via the new evaluation function. 

Then, for the computation of a = \/nr 2 , only three multiplications of the evaluations r = f(r) and 
tv = g(r ) and one root are required, thus SCI(H, 12, Ad, A)r = 0. However, SCI(S, 12, Ad, A)a > 1 still 
holds since, if there was an Arithmetic finite algorithm for the computation of a from any r, and in particular 
from r = 1, this algorithm yields yfrv after finitely many arithmetic operations. Having this, one easily 
obtains an Arithmetic algorithm which computes a/yfrv = \Jtv r 2 / yfrv = r 2 / 3 from every input r > 0, a 
contradiction. 
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Remark 2.11 (Assumptions on A). Motivated by the example above, there are several settings that may 
be considered when analyzing the SCI, for example: 

(I) A contains all constant functions. 

(II) Let T be a general algorithm, A,B £ Q. and Ap(A) C Ap(A) denote the set of constant functions. 
Then Ap(A) = Ap(/J). In particular, the constant functions are the same for A and B. 

(Ill) A contains no constant functions. 

Note that, as Example |2.10l demonstrates. given a specific type of tower of algorithms, the SCI may change 
depending on the assumptions I, II or III. Also, a lower bound on the SCI with assumption I is stronger than a 
lower bound with assumption II, and similarly with II and III. For upper bounds, the direction is the opposite, 
an upper bound is stronger with assumption III than with assumption II and similarly with II and I. 

When considering general towers of algorithms this does not affect the outcome, as the next theorem 
establishes. In particular, neither adding constant functions nor canceling them will then change the SCI G . 

Theorem 2.12. Let (S, Q, M, A) be a computational problem, and let A be the union of A and all the 
constant functions on Lt. 

(1) For every general algorithm r : LI —> M which, for each A € LI, applies certain evaluation 
functions Ap(A) C A resp., there exists a general algorithm r : Ll —^ Ai with the same output 
r(A) = r(A) which only applies the evaluation functions from Ap(A) := Ap(A) n A, for each 
A £ O, resp. 

(2) SCI(H,n,M,A) G = SCI(S,n,M,A) G . 

Proof. Obviously, (2) is an immediate consequence of (1): Just replace each general algorithm in the tower 
by its substitute which is given by (1) and does not make use of any additional constant evaluation functions. 

For (1), let T : fl — > Ai be a general algorithm. Then Y( A) is determined by A r (A) for each A £ fi, 
respectively. We just have to show that the output T(A) can already be identified from Ap(A) for each A. 
Assume the contrary, i.e. there are A, B £ LI such that f(A) ^ f (f?) although Ap(A) = Ap(f?) and 
Af = By holds for all / £ Ap(A) = Ap (B). Then we even have Af = Bf for all / £ Ap(A) since the 
functions which additionally occur here are just constants. By condition (iii) in Definition 12.31 A^(B) and 
Ap ( A) must coincide which now easily yields that T cannot distiguish between A and B, a contradiction. □ 

We emphasize that by this theorem all the lower bounds on the height of general towers that are proved 
within this paper are invariant under choosing I, II, or III. 

Most of the upper bounds on the SCI are achieved with the more specific arithmetic towers even under 
assumption III (no constants needed). For the cases where constants are needed we use II, and in those 
cases the specific constant functions will be spelled out. This is the case for operators with controlled resol¬ 
vent, bounded dispersion and Schrodinger operators with bounded potential. Interestingly, for Schiodinger 
operators with potential that blow up at infinity we only need III. 

Remark 2.13 (Adaptivity vs. non-adaptivity). One could restrict (i) in Definition l2.3l and require that the 
evaluation sets Ap ( A ) are the same for all A £ fl. In this case obviousy (iii) in Definition l2.3l is superfluous. 
This would give us a non-adaptive tower of algorithms as opposed an adaptive tower of algorithms where 
we do not have this restriction. This subtlety is quite important. In particular, it could change the result of 
the SCI. Note that a bound SCI > 2 is stronger if the tower of algorithms is adaptive, however the result 
SCI < 2 gets stronger if the tower of algorithms is required to be non-adaptive. In this article we will always 
let the towers be adaptive. However, we do want to point out that all our upper bounds for the SCI in this 
paper are done with non-adaptive towers. Thus, the results that we obtain are actually slightly stronger than 
what the theorems read. 
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3. Main theorems on computing spectra 

Computing spectra of linear operators is a fundamental problem that has received an overwhelming 
amount of attention over the last decades I3ti5l fl5lfl6ll2()ll211l30l[32ll46ll47ll49ll64ll75ll82ll851l871l97ll . and 
we can only cite a small subset here. We consider the computational problem {H, A4, A} in Examnle l2.il 

(I) in Section[2]with the Hausdorff metric on A4 defined by 

(3.1) <7h(2l, Y) = max < sup inf d(x,y), sup inf d(x,y) > , 

{xeXV^ Y yGY* e- Y J 

where d(x, y) = \x — y \ is the usual Euclidean distance. We ask the basic question: 

(i) Given a bounded operator A on a separable Hilbert space H and suppose that we can access the 
matrix elements {(Aej, GHijgN, where {ejjjgN is some orthonormal basis, can one compute the 
spectrum of A (given a Radical tower or an Arithmetic tower), and what is the SCI? 

This long standing question was partially answered affirmatively in j49l . however the result provided was 
only the bound SCI(S, < 3 where 5(A) = sp(A) and the domain of 5 is Q. = 13(1-1) (we omit the last 
two variables Ad, A in the SCI when they are obvious). Thus, we were left with a series of open problems 
such as: 

(ii) Is the SCI for spectra of operators > 1 for Arithmetic towers, or even for General towers? 

(iii) If so, what is the SCI for spectra of operators in different classes (e.g. self-adjoint, normal, compact, 
operators with off-diagonal decay etc.)? 

(iv) The above questions can be repeated when replacing the spectrum with the pseudo spectrum (see 
Definition l3.4l i or the essential spectrum. 

In the following theorems we completely characterize the SCI of spectra and pseudo spectra of different 
classes of operators and towers. 

Remark 3.1. We can think of every separable Hilbert space with an orthonormal basis {6i}igN as the space 
( 2 (N) equipped with the canonical basis {ei}igpj in the sense of the isometrical isomorphism given by the 
mapping h,; i -Y ei,i £ N. Thus, we can restrict ourselves to this prototype Z 2 (N) of a separable Hilbert space 
and the evaluation of all operators w.r.t. the canonical basis in all what follows. In particular, the evaluation 
set A shall be the set of functions fij : A H > ( Aej,ei ), i,j £ N, for A £ B(l 2 (N)). We further introduce 
the orthogonal projections P n onto the subspaces spanned by {ei,..., e„}, respectively. 

3.1. Operators with controlled resolvents. As a start, let g : [0, oo) —> [0, oo) be a continuous function, 
vanishing only at x = 0 and tending to infinity as x —> oo, and suppose that we know g. We consider 
bounded operators that have the following property: 

(3.2) ||(A — z/) -1 || -1 > g(dist( 2 ,sp(A))) ineverypoint z £ C, 

where we use the convention ||i? _1 || _1 := 0 if B has no bounded inverse. Before we prove a result on the 
SCI of spectra of operators having a common bound of such a type, we commence with some basic remarks. 
Clearly, we can suppose that g(x) < x for all x, since dist(z, sp(A)) > || (A — z/) -1 || -1 always holds. We 
further recall that self-adjoint operators and normal operators share this property with g(x) := x. Moreover, 
notice that for every operator A there always exists such a g (define g(a) := min{||(A — z/) _1 || _1 : z £ 
C with dist( 2 , sp(A)) = a}, taking continuity and compactness into account) although there is no g which 
works for all A. We will address this fact later again. 

Remark 3.2 (Assumptions on A). In order to make the “additional knowledge” g available for the algo¬ 
rithms we apply the approach of Remark 12.111 and assume that A contains, besides the usual evaluations 
fij : A i—^ (Aej, ei) ( i,j £ N) also the constant functions gij : A i —> g(i/j ) (i,j £ N), which provide the 
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values of g in all positive rational numbers. In the case of normal operators we do not need this information, 
and thus A does not need to contain any constant functions. 

Theorem 3.3 (Self-adjoint, normal and controlled resolvent). Given g as above, let fli denote the set of 

all bounded operators on ( 2 (N) with the property (13.2b . fp the set of all bounded normal operators and Q 3 
the set of all bounded self-adjoint operators. Consider 

zz : Dj 3 A 1 —)■ sp(.A) £ Af i = 1,2,3. 

Then, for each i = 1,2,3, 

(3.3) SCI(5,n i ) G = SCI(5,n i ) A = 2. 

Moreover, (13.3b holds ifVli is replaced by ilf 1 where Qf* := {A £ f \ : ||A|| < M}, where M > 0. 

The fact that the SCI of spectra of self-adjoint operators is equal to two may come as a surprise. In 
particular, even when faced with a problem that depends continuously on the input, there is no tower that can 
compute the spectrum in one limit. In fact, since SCI(S, 0 ,)g = 2, adding more operations allowed in the 
tower will never solve this issue. In particular, even if one could compute the spectrum of a finite-dimensional 
matrix using finitely many arithmetic operations (of course no such algorithm exists), it is still impossible to 
compute spectra of self-adjoint operators in one limit. However, the good news that SCI(S, 11, )= 2, thus, 
the computation can be done in two limits when arithmetic operations are allowed. Note how this improves 
the results from Il49ll substantially. 

3.2. Non-normal operators. Another set that has received substantial attention is the pseudo spectrum. 
This set has been popular in spectral theory, analysis of pseudo differential operators and non-Hermitian 
quantum mechanics. Recently, the concept of pseudo spectra has been generalized to what one refers to as 
TV-pseudo spectra. 

Definition 3.4 (Pseudospectra). For N £ Z + and e > 0, the (TV, e)-pseudospectrum of a bounded linear 
operator A £ B{ji) on a Hilbert space Tj is defined as the set 

sp N ,e(A) := {z £ C : ||(A — zAp 2 ”|| 2 " > 1/e}. 

For N = 0 this is the ( classical) e-pseudospectrum 

sp e (A) := {z £ C : ||(A — zAp 1 1| > 1/e}. 

For more information on pseudospectra we refer to lf33ll46ll47H49l[82li86ll97l . Also recall that the sets 
spn,c(A) are continuous w.r.t. the parameter e > 0, and converge to sp(A) as e —> 0 for every A. Another 
class of operators of interest is the family of operators with controllable off-diagonal decay. Before we get 
into the details we must introduce the concept of dispersion. 

Definition 3.5 (Dispersion). We say that the dispersion of A £ B(J 2 (W) ) is bounded by the function 
f : N -A N if 

Df, m {A) := max{||(J- P /(m) )AP m ||, \\P m A(I - P /(m) )||} -3 0 as m ^ 00 . 

Note that for every operator A there is always a function / which is a bound for its dispersion since AP m , 
P m A are compact and (P„) converges strongly to the identity. But there is no function / which acts as 
a uniform bound for all operators. Nevertheless, there are important (sub)classes of operators having well 
known uniform bounds, which should be mentioned: 

(i) band operators with bandwidth less than d: f(k) = k + d. 
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(ii) band-dominated and weakly band-dominated operators: f(k) = 2k. For definitions and properties 
of band and band-dominated operators see Il63ll77l[83l . Weakly band-dominated operators can be 
found in l66l . 

(iii) Laurent/Toeplitz operators with piecewise continuous generating function: f(k) = k 2 (cf. ifTTI 
and lf56l Proposition 5.4]). 

(iv) Let T be a family of bounded operators with a common bound /. Then/, given by/(fc) = f(k) + k, 
is a common bound for all operators in the Banach algebra which is generated by T. 

Remark 3.6 (Assumptions on A). In the case when the dispersion of the operator is known, the values 
/(to) (to £ N) shall be available to the algorithms as constant evaluation functions (as was done in Remark 
12.1 H and also for the function g above). However, if the dispersion is not known then A will not contain any 
constant functions in the theorems below. 

Theorem 3.7 (General, compact and dispersion operators). Define the following primary sets: fli = 
B(l 2 (N)), for f : N —> N let fl 2 denote the set of bounded operators on ( 2 (N) whose dispersion is bounded 
by f and fl 3 = /C(( 2 (N)), where /C(( 2 (N)) denotes the set of all compact operators. Define the following 
problem functions: Si (A) = sp(A) and for e > 0 and N £ Z + , let S 2 (A) = spN, e (A). Then 

(3.4) SCI(S 1 ,n 1 ) G = SCI(S 1) n 1 )A = 3, SCI(S 2 ,L! 1 ) g = SCI(S 2 ,G 1 )a = 2, 

(3.5) SCI(S 1 ,n 2 ) G = SCI(3 1) n 2 )A = 2, SCI(3 2 ,Q 2 ) g = SCI(S 2 ,0 2 )a = 1, 

(3.6) sci(Ei, n 3 ) G = scitSi, n 3 ) A = 1 , SCI(S 2 , n 3 ) G = sci(s 2 , n 3 ) A = 1 . 

Moreover, (13.4b . (13.5b and (13.6b hold ifLli is replaced by flf 1 where flf 1 = {A £ flj : ||A|| < M}, where 
M > 0. 

By Theorems 13.31 and 13.71 we observe that the SCI is equal to three as long as there is no additional 
information on the structure of the operators under consideration known and can be taken into account. 
Once a function g is available which estimates the behaviour of the resolvent norm in the sense of (13.2b . the 
SCI decreases to two . The same holds true, if a bound / on the dispersion is known. 

3.3. Computing the essential spectrum. We will end this section with the intriguing result that in the 
case of operators with (/-bounded resolvent norm and /-bounded dispersion, we have that the SCI for the 
spectrum is indeed one, yet the SCI for the essential spectrum is two. This may come as a surprise, however, 
what this result tells us is that the problem of distinguishing between the essential spectrum and the isolated 
eigenvalues with finite multiplicity cannot be solved by an algorithm using only one limit. In other words: 
one can compute the spectrum in one limit, but one will never be able to distinguish the essential spectrum 
from the total spectrum. This is summed up in the following theorem. 

Theorem 3.8 (Spectrum vs. essential spectrum). For A £ Gi = B(I 2 (N')') define Si(A) = sp(,4) and 
S 2 (A) = sp ess (A) (the essential spectrum). Let f : N —> N, g : [0, 00) —> [0, 00) and let fl 2 denote the set of 
bounded operators with the property m (or self-adjoint or normal operators) on ( 2 (N) whose dispersion 
is bounded by f. Then 

sci(H lj n 2 ) G = sci(Si,fi 2 ) A = i ! 

(3.7) SCI(S 2 , G 2 ) g = SCI(S 2 , n 2 )A = 2 , 

SCI(S 2 ,G 1 ) g = SCI(S 2 ,L! 1 )a = 3. 

Moreover, all the results in 17.7b hold iffli is replaced by f if* where Llf 1 = {A £ f f : ||A|| < M}, where 
M > 0. 
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Having considered bounded operators in this section we now turn to arguably one of the most important 
unbounded operators in the last decades: the Schrodinger operator. 

4. Main theorems on computational quantum mechanics 
T he Schrodinger operator 

(4.1) H = -A + V, 

is one of the most celebrated operators of modern times and the mainstay in quantum mechanics, and it is 
its spectrum that is of utmost importance. If we fix the domain of H such that it is appropriate for a class of 
potentials V, the spectrum of H is uniquely determined by V. The basic question is therefore: 

Given the potential V, so that V can be evaluated at any point x £ W 1 , can one compute the spectrum 
(or pseudospectrum) of H, and what is the SCI? 

Note that this problem has been unsolved for a long time when considering H acting on L 2 (R d ) allow¬ 
ing non self-adjointness and arbitrary complex potentials. This is not a surprise given that the problem of 
computing spectra of arbitrary non self-adjoint infinite matrices has only recently been solved (49). How¬ 
ever, there is a vast amount of work on how to compute spectra of Schrodinger operators with specific real 
potentials il0lll8l[l9ll27ll291l36ll65ll73ll . We emphasise the importance of generality as we want the theory to 
include non-Hermitian quantum mechanics Il8l9l5lll52l and the theory of resonances H92I101H . The key issue 
is that the only available input is the potential function V (no matrix values are assumed). In this section we 
shall solve this problem for very general classes of potentials and give bounds on the SCI. We consider the 
computational problem {3, f], M , A} in Example l2. 1 I f IV) in Section[2]with the Attouch-Wets metric defined 
by 

1, sup \d(x, A) — d(x, B)\ > , 

\x\<i J 

where A and B are closed subsets of C, and where d(x. A ) is the usual Euclidean distance between the point 
x £ C and A, which is well-defined even when A is unbounded. We discuss some properties of this metric 
in Remark II 1.21 Also, since the pseudospectrum may be discontinuous w.r.t e in the case of unbounded 
operators, it is more convenient to redefine it for the unbounded operator H to be 

sp e (tf) := cl({* £ C : \\(H - zl)~ l \\ > 1/e}). 

This is however equivalent to the definition in the bounded case in Definition l3.4l fsee e.g. 1141 1421186) ). 

4.1. Bounded potentials. We will first consider 

Hi := {V : V £ L°°(IR d ) D BV 0 (R d )}, 
where </>: [0, oo) —> [0, oo) is some increasing function and 

(4.3) BV 0 (R d ) = {/ : TV(/ [ _ a , a]d ) < </(«)}, 

(f[-a,a] d means / restricted to the box [—a, a] d ) with TV being the total variation of a function in the sense 
of Hardy and Krause (see Co)). 

Also, we consider Schrodinger operators with controlled resolvents. In particular, let g : [0, oc) —> [0,oo) 
be a continuous function, vanishing only at x = 0 and tending to infinity as x —> oo, and suppose that we 
know g. Define 

(4.4) fl 2 = fli fl fi = {V: Wi-A + V-zI)- 1 ^ 1 > 5 (dist(^, sp(-A + V)))}. 

Note that the set requires a little bit more than V just being locally of bounded variation. We also need 
to know an upper bound on the growth of the total variation as we restrict the function to a larger set. In 


(4.2) 


d\w(A, B ) = ^ 2 
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particular we need to know 6 or an estimate for it. The set fl of operators obviously includes self-adjoint and 
normal operators, however, is much bigger. 

Remark 4.1 (Assumptions on A). As done in the case of bounded Hilbert space operators, and as dis¬ 
cussed in Remark l2.1 II the additional knowledge of g is available for the algorithms by assuming that A also 
contains the constant functions g,_j : V g(i/j ) ( i,j £ N), which provide the values of g in all positive 
rational numbers. In addition we will assume that A contains certain constant functions that will be specified 
in Section ITl.ll in Remark fl 1 .61 . 

Theorem 4.2 (Bounded potential). Let T>(H) = W 2,2 (R d ) and define Si(U) = sp(—A + V) and, for 
e > 0, let 32(V) = sp e (—A + V). Then 

f < 2 fl = Hi 

SCItSi.nW” , SCI(3 2 ,H 1 )a<2. 

1 = 1 fi = fi 2 

Remark 4.3. As will be evident from the proof techniques, one can build towers of algorithms for operators 
with more general classes of potentials (for example L°°(IR d ) fl BVi oc (R d )) or L 2 (R d ) fl BVi oc (R d )), 
however, the height of these towers will be higher than the ones considered in this paper. The main future 
task is to obtain exact values of the SCI of the spectrum given the different potential classes. 

4.2. Unbounded potentials. We get a rather intriguing phenomenon for sectorial operators. Namely, the 
SCI of both the spectrum and the pseudospectrum is one. In particular, suppose that we have nonnegative 
9i, 02 such that 0\ + 9 2 < 7t. Define 

(4.5) fl = {V £ C(R d ) : Vxarg(V(x)) £ [— 9 2 ,9i], |V(x)| —> oo as x —> oo}. 

We define the operator H via the minimal operator h as: H = h**, h = — A + V, T>(h) = C£°(R d ). When 
V £ LI it follows that H has compact resolvent, a result that we also establish as a part of the proof of the 
following theorem. 

Remark 4.4 (Assumptions on A). Interestingly, no constant function are needed in A in order to obtain 
the results in the following theorem, as opposed to the case where we have a bounded potential. 

Theorem 4.5 (Unbounded potential). Let Si : Cl 9 V i->- sp(// ) and, for e > 0, H 2 : SI 3 k 4 sp e (H). 
Then 

SCIfHi, fi) G = SCI(Hi, fi) A = SCI(S 2 , Q) g = SCI(S 2 , fl) A = 1. 

Note that the key to this result is the compact resolvent of H. The fact that the SCI is one in this case 
is most natural as the SCI for spectra and pseudo spectra of compact operators is one. The continuity 
assumption on V in Theorem l4.5l is to make sure that the discretization used actually converges. However, 
by tweaking with the approximation this assumption can may be weakened to include potentials that have 
certain discontinuities. 


5. Main theorems on solving linear systems 

Just as finding spectra of operators and roots of polynomials, the problem of solving linear systems of 
equations is at the heart of computational mathematics. For the finite-dimensional case it is easy to find an 
algorithm that can perform the task, but what about the infinite-dimensional case? In particular, if b £ l 2 (N), 
A £ Hi nv (7 2 (N)) (the set of bounded invertible operators) and fl C Si nv (( 2 (N)) x ( 2 (N) and we define the 
mapping 3 : f l 5 ( A, b) A ~ 1 b. what is the SCI of this mapping given different domains fl? 
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Remark 5.1 (Assumptions on A). Here, as in Example 12.11 we again suppose that the set A of evalu¬ 
ations consists of the functions which read the matrix elements {(Aej, G)}jjeN and the sequence entries 
{(&, efe}}fe £ N of ( A, b) £ SI. Also, in the case when the dispersion of the operator is known, the values /(m) 
(to £ N) shall be available to the algorithms as constant evaluation functions (as was done in Remark [2.1 II) . 
However, if the dispersion is not known then A will not contain any constant functions in the theorems below. 

Theorem 5.2 (Linear systems). Let B lnv j(l 2 (N)) denote the set of bounded invertible operators with 
dispersion bounded by f : N —> N, £>i nViSa (( 2 (N)) denote the set ofbounded invertible self-adjoint operators, 
and define the domains Qi = Si nv (( 2 (N)) x ( 2 (N), fl 2 = Bi nVtSa (l 2 (N)) x ( 2 (N) andO .3 = B lnv j(l 2 (N)) x 
( 2 (N). Then 

(5.1) SCT(5,n 1 ) G = SCI(H } n 1 ) A = 2 J 

(5.2) SCI(S,H 2 ) g =SCI(5,H 2 )a = 2, 

(5.3) SCI(S, fi 3 ) G = SCI(3, S1 3 ) A = 1. 

Another problem of interest when dealing with solutions of linear systems of equations is the computation 
of the norm of the inverse. This is obviously related to the stability of the problem. The task of computing 
the norm of the inverse of an operator can also be analysed in terms of the SCI, and that is the topic of the 
next theorem. 

Theorem 5.3 (Computing norm of the inverse). Let Hi = B(J 2 (Nj), 0 2 the subset of self-adjoint oper¬ 
ators, H3 the subset of operators with dispersion bounded by an f : N —> N, and let B : i H ||A -1 ||.Q 


Then 


(5.4) 

SCI(3,H 1 ) g =SCI(H,H 1 ) a = 2, 

(5.5) 

SCI(3, H 2 ) g = SCI(E, H 2 ) a = 2, 

(5.6) 

SCI(3, H 3 ) g = SCI(3, H 3 )a = 1. 


6. Main theorems on the impossibility of error control 

One of the natural desires in computations is error control. In particular, given a computational prob¬ 
lem {£, H,AI,A} with SCI(S, fl,Af, A) Q = k for some tower of algorithms of type a, and a tower of 
algorithms of height k, Y rik ■ ■ ■ ■ ■ I’ n ,, it is highly desirable to be able to control the convergence 
T nk —► 3, ... —> r nfcj ..., n2 . More precisely, for e > 0, how big do rik, ■ ■ ■ ,n\ have to be 

such that 

d(T nkt ..., ni (A),Z(A))<e, VAeH. 

This type of global error control is quite common in different areas of computational theory such as in 
differential equations and integration 63. 

Unfortunately, such choices of nk, ■ ■ ■, n\ may be impossible. More precisely, problems with SCI greater 
than one with respect to a General tower will never have error control. This is summed up in the following 
theorem. 

Theorem 6.1 (No global error control). Given a computational problem {5, H, A4, A} where we have 
SCI(S, H, M, A)g > 2 . Suppose that there is a general tower of algorithms of height k, r„ fc ,..., r rafc 
for {5, H, A4 , A}. Then there do NOT exist integers nk = rife (to), ..., ni = n\(m) (depending on m) such 
that 

d(r nj>I ..., ni (A),S(A))<-, Vigil, VmgN. 


^As usual, || A 1 


:= 00 if A is not invertible. 
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With such a negative result, it is natural to consider a weaker concept than global error control such as 
local error control. In particular one could ask if the following is true: 

VA £ and Ve > 0, 3 rik, ■ ■ ■, n\ such that (A), 3(A)) < e. 

Indeed, it is, and note that the existence of rik ,... ,ni is guaranteed by the definition of a Tower of al¬ 
gorithms. If we could find the integers njt,... ,n\ we would call this local error control. However, the 
integers 74 ....., ri \ cannot be computed as the next theorem demonstrates, and thus local error control is 
also impossible. 

Theorem 6.2 (Local error control cannot be computed). Given a computational problem {5, (2, M ., A} 

with SCI(S, f2, Ai, A)q > 2, suppose that there is a general tower of algorithms T nfc ,..., T nfc of 
height kfor {5, fl, A4, A}. Then, there does NOT exist a sequence {T n } of general algorithms T n : 
such that for any A £ f2, 

Remark 6.3. The conclusion of Theorem 16.II and Theorem l6.2l is the following. Given a computational 
problem {3, Q.A4, A} with SCI(E, Q. JA, A)q > 2, then no one can ever know when one is ’’epsilon” away 
from the solution of the problem. In particular, if we have that SCI(S, f2, Ai , A)g = SCI(3, ft, M ., A)a = 
k > 2, one can compute an approximation to the solution, but never know when to stop. 

A weaker requirement than error control would be that one could reindex the tower to get only one 
limit. In particular, let l' nk , ■ ■ ■, be a General tower of algorithms for a computational problem 

{3, 12, A4, A} where the cardinality of A is infinite. By (12.21) . for any A £ fi, there exists a reindexing 
function : N —> N k such that 


r h A (n) k ,...,h A {n)i f n ~* °°- 

However, the function Iia is also impossible to compute as the following theorem establishes. 

Theorem 6.4 (Impossibility of computing the reindexing function). Given a computational problem 
{5, ST, 7VT, A} with SCI(S, Q, M., A)g > 2, suppose that there is a general tower T„ fc ,..., of 

algorithms of height kfor {S, 12, A4 , A}. Then, there does NOT exist a sequence {T n } of general algorithms 
f n : fl —> N fe such that for any A £ Q, 

r f n (A) fc ,...,f„(A) 1 (^) ^ a (^)i n ->• 00. 

Remark 6.5. Note that the result above sparks the discussion about which problems can actually have error 
control. 

(i) (Polynomial root finding with rational maps) As we will see in Section [8] the SCI of finding 
a root of the quintic given a Doyle-McMullen tower is greater than one. However, this does not 
imply that one cannot have error control. In particular, a potential reindexing of the functions in 
the tower, due to error control, could be possible. The new reindexed tower, however, will not be a 
Doyle-McMullen tower. Thus, the reindexing does not violate the bound of the SCI w.r.t. Doyle- 
McMullen towers. The situation is very different in the case of inverse problems. 

(ii) (Inverse Problems) As we discussed in Section [5] the SCI for solving arbitrary linear systems with 
a General tower is two. This is a much stronger statement when it comes to limitations of error 
control than in the polynomial case. In particular, since this holds for an arbitrary General tower, one 
cannot provide error control regardless of what kind of mathematical tools are allowed. However, 
the interesting problem will be to determine which subclasses of problems can be solved with error 
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control. These problems must have SCI(H, fi, A4, A)g < 1, however, this is of course only a 
necessary but not sufficient criterion. 

7. Computability, Arithmetical Hierarchy, mathematical logic and the SCI 

In this section we build a bridge to the classical theory of computability and mathematical logic, and 
demonstrate how the SCI and towers of algorithms extend important concepts in that field. Decision making 
problems are at the heart of the theory of computation and as a motivation we start with some basic problems. 
Intriguingly, it is through this framework we can prove some of the lower bounds on the SCI for spectral 
problems. 

7.1. Decision making. Within this section we exclusively deal with problems (functions) 

S : n M := {Yes,No}, 

where JH is equipped with the discrete metric. This means that for such problems we search for General 
algorithms T„ fc ... rai : H — > Ai which, for a given input w £ f2, answer Yes or No. We will refer to such 
problems as decision making problems. Clearly, a sequence {roj C Ai of such “answers” converges to 
to £ Ad if and only if finitely many m,; are different from to. The number of decision problems is countless, 
and we will only consider some basic illustrative problems here. 

Let Hi denote the collection of all sequences {cu}i e N with entries ai £ {0,1}. For {ai}j £ N £ Hi we 
define E({a,}) to be the answer to the following question, 

Si({ai}): Does {cu}i e N have a non-zero entry? 

For such problems the evaluation set A shall consist of the functions j), : {a,} i—>• a k , k G N, which read 
the fcth entry of a given function respectively. It is easy to see that SCI(Si, Hi)q = 1 in this case, 

however, we could ask the slightly more difficult question 

^({ai}): Does {cu} have infinitely many non-zero entries? 

And of course, we could extend these questions to matrices. In particular, let f?2 denote the collection of all 
infinite matrices with entries a^j G {0,1}, and consider the following problems, 

Does {aij} have a non-zero entry? 

S4({ai,j}) : Does {aij} have infinitely many non-zero entries? 

The situation becomes more complex and less clear with the following questions: 

E5({ct,;,j }) : Does {aij} have a column containing infinitely many non-zero entries? 

So ({a,.,}): Does {aij} have infinitely many columns containing infinitely many non-zero entries? 
S7({ai,j}) : Does {aij} have (only) finitely many columns with (only) finitely many Is? 

And finally the following question that intriguingly is linked to the problem of showing the lower bound 
on SCI for spectra of operators. In this case we consider infinite matrices indexed by Z rather than N. In 
particular, let H3 denote the collection of all infinite matrices with entries aij £ {0,1}. We then 

ask the following question. 


is : Wij}ijez j 3 DMj (|\/i ^ a k j < D\ V ('iR3i^a k j > i?A ^ a k j > R\ J J 

\ \ \ fc— i / \ /c—0 k= i / / / 

(“there is a bound D such that every column has either less than D Is or is two-sided infinite”) 


The purpose of this section is to first demonstrate how decision problems, that are the core of classical 
recursion theory, fit naturally into the SCI framework. And also provide key results that are crucial for 
proving lower bounds on the SCI for other problems. The following theorem sums up the bounds for the SCI 
for the first four of these problems. 



SCI 


19 


Theorem 7.1 (Decision making problems 1-4). Given the setup above we have 

sci(H 1 ,n 1 ) G = sci(s 1 ,n 1 ) A = i, 

SCI(S 2 , f) 1 ) G = SCI(3 2 , r2i) A = 2, 

sci(s 3 , d 2 )g = sci(s 3 , n 2 ) A = i, 
sci(s 4 , n 2 ) G = sci(s 4 , n 2 ) A = 2, 

Theorem 7.2 (Decision making problems 5-8). Given the setup above we have 

SCI(S 5 , H 2 ) G = SCI(S 5 ,0 2 ) A = 3, 
sci(s 7 , n 2 ) G = sci(h 7 , n 2 ) A = 3, 

SCI(S 8 ,n 3 )G = SCI(S 8 ,D 3 )A = 3, 

3 <SCI(S6, H 2 )g < SCI(S 6 ,0 2 )a < 4. 

We may also look at spectral theory from a decision making point of view and ask: How difficult is it 
to decide whether a given point A G C belongs to the spectrum of a given bounded linear operator A or 
not? Obviously this question is equivalent to asking whether 0 € sp(A — XI). Thus, it suffices to study the 
question for A = 0. To make this more precise, we consider the functions 

sp° : B{l 2 (N)) -+M:= {Yes, No}, 4^(0e sp(A)) 

The following theorem addresses this question. 

Theorem 7.3 (Decision making and spectra). Let ft C B(l 2 (N)) and define sp°(/ij. Let 

Hi = B(l 2 ( N)), / : N —> N and f) 2 denote the set of bounded operators on l 2 ( N) whose dispersion is 
bounded by f, and finally let fl 3 denote the set of bounded self-adjoint diagonal operators. Then 

sci(s,n 1 ) G = sci(s,n 1 )A = 3, 

SCI(3,0 2 ) G = SCI(3, 0 2 )a = 2, 

SCI(3,0 3 ) G = SCI(3,0 3 ) A = 2. 

This result reveals that even for self-adjoint diagonal operators there do not exist any height-one towers 
of algorithms that can decide sp°, although there is a height-one tower which computes the whole spectrum 
as is claimed in Theorem 13.81 This seems to be a bit surprising at a first glance. Actually, the present 
question is really stronger in a sense: From Theorem l3.8l we only get approximations for the spectrum which 
converge with respect to the Hausdorff distance, but which can still have even an empty intersection with 
sp(A), whereas Theorem l7.3l addresses the inclusion A £ sp(A). 

Note that the SCI of the decision problems above are considered with respect to general and arithmetic 
towers. These towers do not assume any computability model, but only a model on the mathematical tools 
allowed (arithmetic operations in the case of arithmetic tower) and the way the algorithm can read informa¬ 
tion (only finite amount of input). However, the SCI framework with towers of algorithms fit naturally into 
the classical theory of computability and the Arithmetical Hierarchy. 

7.2. Computable functions. Our considerations will be based on so-called recursive functions whose def¬ 
inition is due to Godel and can be found e.g. in ll72l 1.1.7], The precise definition is not really important 
for our following considerations since, due to a Basic Theorem ( GZJ Theorem 1.7.12]), it is equivalent to 
Turing computability, register machine computability, being finitely definable. Church’s A-definability, flow¬ 
chart computability, or many other well-established concepts of computable functions (see also 116011611 and 
a recent survey article on the topic 04l ). Church's Thesis (Church, Turing, 1936) proposes that 

Every effectively computable function is recursive. 
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Therefore, in all what follows we will refer to recursive functions simply as computable functions meaning 
one or all of the above precise concepts, or by Church’s Thesis any ’effectively computable function’. For a 
different model based on the Blum-Shub-Smale machine see m 

Remark 7.4. Functions R : Z™ —> {true, false} (or { Yes. No) in our previous notation) are called n-ary 
relations. Note that such relations can also be regarded as the characteristic functions of the respective sets 

{(zi, ...,r n )eZJ: R(xi,.. .,x n )}. 

7.3. The Arithmetical Hierarchy. A corner stone in classical logic is the notion of Arithmetical Hierarchy, 
which we will see later is strongly connected to the SCI and towers of algorithms. Before we can present our 
main contribution in this section, we need to recall some basic definitions and results. 

Definition 7.5 (Arithmetical Hierarchy). A relation R is in the Arithmetical Hierarchy if R is computable 
or if there exists a computable relation S such that R can be obtained from S by some finite sequence of 
complementation and/or projection operations. For this, the projection of f : Z™ —> {true,false} along the 
jth coordinate is defined as the characteristic function of the set (which is a subset ofT/fT 1 ) 

{(:ci, .. .,Xj-!,Xj +1 , ...,x„) : (3x j )f(x 1 , ... ,x n )}. 

The following equivalent conditions describe the Arithmetical Hierarchy (see 1721 IV.l] and lf79l 14.1-4]). 
In particular, let R be an n-ary relation, then the following are equivalent: 

(i) R is in the Arithmetical Hierarchy. 

(ii) R can be represented as 

(*i, ■ ■ • ,x n ) i-A- (Qiyi) ■ ■ ■ (Qm.ym)S{x i, ...,x n ,y i,.. .,y m ) 

where Qi is either (V) or (3) for i = 1,..., m, and S is an (n + m)-ary computable relation. 

(iii) R can be represented as 

(zi, ■ ..,x n )i-A (Q\yi) ■ ■ ■ {QkVk)T(x i,.. .,x n ,yi,...,y k ) 

where (Qi) is a list of alternated quantifiers ((V) and (3)), and T is an (n + fc)-ary computable 
relation (Prenex normal form, Kuratowsky, Tarski 1931). 

(iv) R is definable in First-Order Arithmetic (Godel, 1936). 

Following ll62l we define the classes of E„, n„ and A ra relations, proceeding by induction: 

Definition 7.6 (£ m , n m , A m ). Let m £ Z+. We then define the following. 

(i) A relation is So and Ho if it is computable. 

(ii) A relation is S m+ i if it can be expressed in the form (3 y)S(x, y), where S(x, y) is n m . 

(iii) A relation R is TL m +i if its complementary relation ~<R is E m +i. 

(iv) A m := S m n n m . 

It is easily seen that a relation R(x) is £ m iff it has a definition of the form 

(xi,... ,x n ) H- (3yi)(Vy 2 ) ■ ■ ■ ^(xi,... ,x„,yi,.. -,y m ), 

where S(x, y) is computable and there are m alternating quantifiers starting with 3. An analogous observa¬ 
tion holds for n ro relations, with in alternating quantifiers starting with V. This hierarchy is the Arithmetical 
Hierarchy, or Kleene-Mostowski Hierarchy 116011691 , and does not collapse. More precisely, we have the 
following |[72l IV.l.131: 

Theorem 7.7 (Hierarchy theorem). For any m £ N, we have the following: 


(i) S m \ Hm ^ 0. hence A m A £ m . 
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(ii) II777, \ ^ 0, hence Am ^ n m . 

(iii) U II rn ^ ^m+1- 

(iv) R e \Jnez + Zn if and only if R is in the Arithmetical Hierarchy. 

Note that this hierarchy is related to the classification based on the Turing jump operation by Post’s The¬ 
orem (e.g. (79l 14.5 Theorem VIII]). See also HI 001 regarding the Arithmetical Hierarchy of real numbers. 

7.4. The SCI and the Arithmetical Hierarchy. Given a subset A C Z + with characteristic function \A 
being definable in First-Order Arithmetic, we are interested in the SCI of deciding whether a given number 
x € Z + belongs to A or not. In other words, we want to determine the value of the characteristic function 
of A at the point x. Thus, we want to consider Towers of Algorithms for \A where the functions/relations at 
the lowest level shall be computable, and we again ask for the minimal height. More precisely, we consider 

• the primary set := Z + , 

• the evaluation set A = {A} consisting of the function A : Z + —> C, x 1 —> x, 

• the metric space M := ({true,false}, ddiscr ) = ({Yes,No}, ddiscr), 

where ddiscr denotes the discrete metric, and consider all functions H : CL —> Ai in the Arithmetical Hi¬ 
erarchy. In honour of Kleene and Shoenfield we call a Tower of Algorithms that is computable a Kleene- 
Shoenfield tower. 

Definition 7.8 (Kleene-Shoenfield tower). A tower of algorithms given by a family {r ni: ... n| : 11 A4 : 
n k ,... ,n\ 6 N} of functions at the lowest level is said to be a Kleene-Shoenfield tower, if the function 

x Cl M, (n k , ...,ni,x)t-A T nki ,,, tni (x) 

is computable. Given a computational problem {S, Cl, A4, A} as above, we will write SCI(S, Cl, A4, A)ks 
to denote the SCI with respect to a Kleene-Shoenfield tower. 

We can now present the main theorem, linking the SCI and the Arithmetical Hierarchy. 

Theorem 7.9 (The SCI and the Arithmetical Hierarchy). If H is A m+ i then there exists a Kleene- 
Shoenfield tower of algorithms of height m. Conversely, if SCI(S, H)ks = m then S is A m+ i, but not 


This theorem has an immediate corollary that shows how the SCI can become arbitrarily large. In partic¬ 
ular, for any k £ N there exists a problem that has SCI equal to k. 

Corollary 7.10 (The SCI can become arbitrarily large). For every k G N there exists a problem function 
S on f 1 with SCI(S, H)ks = k. 

Theorem 17 .91 follows immediately from a result by Shoenfield (Theorem 17. 1 ]} , which has roots in the 
papers of Gold l43l . Putnam l76il . Shoenfield (88], and was discussed in I128H811 . It is this result that builds 
the bridge between the SCI and the Arithmetical Hierarchy: 

Theorem 7.11 (Shoenfield 1959, (72 (IV.1.19)). For m £ N a function f : Z + —> {true,false} is A m+ i 
if and only if there is a computable function g : Z™ +1 —> {true,false} such that 

f(y)= lim lim g(y,x 1 ,... ,x m )- 

X 1 —^OO Xm—fOO 

Moreover, Corollary 17. 1 01 follows directly from Theorem l7.7l Next, we want to make the latter observa¬ 
tions more precise, and we want to demonstrate how to transform a given tower into prenex normal form on 
the one hand, and on the other hand, how to define a Kleene-Shoenfield tower of minimal height for a given 
A m+ i relation. Actually, we do not do that for the present setting and for Kleene-Shoenfield towers, since 
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the proof can be found in the literature, but for our initial more general setting of decision making problems 
in Section [7TTl 

7.5. Alternating quantifier forms for General Towers. We return to the more general setting of decision 
making problems of Section 17711 with arbitrary f2, a certain evaluation set A and A4 = {true,false} = 
{Yes,No}. Inspired by the observations on the Arithmetical Hierarchy, we make the following definition: 

Definition 7.12 (Alternating quantifier forms). Given the general setup above we define the following: 

(i) We say that S : fl —> M. permits a representation by an alternating quantifier form of length m if 

l —• — (Qm'tlm) ' * * (Q l?2l)rn m ,...,ni > 

where (Qf) is a list of alternating quantifiers (V) and (3), and all : 11 —> A4 are general 

algorithms in the sense of Definition \2.3\ 

(ii) We say that H is E m if an alternating quantifier form of length m exists with Q m being (3), and that 
H is n m if an alternating quantifier form of length m exists with Q m being (V). 

(iii) We say that H is A m ifE, is E m and H m . 

The following theorem demonstrates how the SCI framework can be viewed as a generalization of the 
Arithmetical Hierarchy to arbitrary computational problems. In particular, one can define a hierarchy for any 
kind of tower. Here we do this for a general tower, and obviously, this can be done for any tower. 

Theorem 7.13 (General Hierarchy). Following Defnition \7.12\ the following is true. 

(i) If SCI(E, fl)c < m then E is 

(ii) IfE is E m or then SCI(5, H)g < in. 

(iii) For to € N we have that SCI(S, Q) g = to if and only if m is the smallest number with 5 being 

A m+ i. 

We will call the hierarchy described above a General Hierarchy. 

8. Roots of polynomials and Doyle-McMullen towers 

In this section we recall the definition of a tower of algorithms from (38). We will name this type of tower 
a Doyle-McMullen tower and demonstrate how the results in If67l and Il38l can be put in a framework of the 
SCI. In particular, we will demonstrate how the construction of the Doyle-McMullen tower in ll38l can be 
viewed as a tower of algorithms defined in Definition l2.4l 

As mentioned in the introduction, one can compute zeros of a polynomial if one allows arithmetic opera¬ 
tions and radicals and can pass to a limit. However, what if one cannot use radicals, but rather iterations of a 
rational map? A natural choice of such a rational map would be Newton’s method. The only problem is that 
the iteration may not converge, and that motivated the question by Smale quoted in the introduction. 

As we now know from 1671 the answer is no, however, the results in l38l show that the quartic and 
the quintic can be solved with several rational maps and limits while this is not the case for higher degree 
polynomials. Below we first quote their results and then specify a particular tower of height three in the form 
that it can be viewed as a tower of algorithm in the sense of this paper. 

8.1. Doyle-McMullen towers. A purely iterative algorithm ll93l is a rational map[] 

T : Pd —A Rat m , p i-A T p 


z I.e. it’s a rational map of the coefficients of p. 
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which sends any polynomial p of degree < d to a rational function T p of a certain degree m. An important 
example of a purely iterative algorithm is Newton’s method. Furthermore, Doyle and McMullen call a purely 
iterative algorithm generally convergent if 

lim T™(z) exists for (p, z) in an open dense subset of x C. 

Here T p (z) denotes the nth iterate T p (z) = T p (T p ~ 1 {z)) of T p . For instance, Newton’s method is generally 
convergent only when d = 2. However, given a cubic polynomial p € P3 one can define an appropriate 
rational function q £ Rat3 whose roots coincide with the roots of p, and for which Newton’s method is 
generally convergent (see ED, Proposition 1.2). In l38l the authors provide a definition of a tower of 
algorithms, which we quote verbatim: 

Definition 8.1 (Doyle-McMullen tower). A tower of algorithms is a finite sequence of generally conver¬ 
gent algorithms, linked together serially, so the output of one or more can be used to compute the input to 
the next. The final output of the tower is a single number, computed rationally from the original input and 
the outputs of the intermediate generally convergent algorithms. 

Theorem 8.2 (McMullen l67l : Doyle and McMullen lf38ll ). For P,j there exists a generally convergent 
algorithm only for d < 3. Towers of algorithms exist additionally for d = 4 and d = 5 but not for d > 6. 

Note that, as shown in lf89ll . there are generally convergent algorithms if one in addition allows the oper¬ 
ation of complex conjugation. In the following we present how the Doyle-McMullen towers can be recast in 
the form of a general tower as defined in Definition l2.4l 

8.2. A height 3 tower for the quartic. In the following X, Y,... denote variables in the polynomials while 
x, y, ■ ■ • G C. We build the tower following the standard reduction path, see e.g. H35I . Given 

p(A) := X ^ -f- a\X 3 -P ct 2 X 2 -\- a 3 X -p 0,4 

one first transforms the equation by change of variable Y = X + ai/4 to arrive into the polynomial 

q(Y) := Y 4 + b 2 Y 2 + b 3 Y + b 4 , 

which one writes, with help of a parameter z, as q(Y) = (Y 2 + z) 2 — r(Y , z) where 

r(Y, z ) = (2z - b 2 )Y 2 - b 3 Y + z 2 - 64 . 

Here one wants a value of z such that r(Y, z) becomes a square which requires the discriminant to vanish: 
4(2z— b 2 )(z 2 — 64 ) — b 2 = 0. Viewing this as polynomial in Z, making a change of variable W = Z+(l/6)b 2 
and scaling the polynomial to monic we arrive at asking for a root of 

(8.1) s{W) := W 3 + c 2 W + c 3 . 

As all these are rational computations on the coefficients of p, we shall not express them explicitly. 

We denote by N{f, £0) the function in Newton’s iteration with initial value £o : 

&+i := where lV(/(£)) = £ - 

and further by Nj the mapping from initial data to the j th iterate Nj : (/, £0) ^ ■ We shall apply Newton’s 

iteration to the rational function f38l 

s{W) 


t(W ): 


3c 2 W 2 + 9 c 3 W - c \' 
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Thus Wj = Nj(t , wq) denotes the j th iterate Wj for a zero for s(w) = 0. This iteration converges in an open 
dense set of initial data. Denote w^ := lim^oo Wj. Now we change the variable Z = W — (1/6)62 and, 
denoting by Zj and z^ the corresponding values, we obtain r(Y. Zoo) as a square: 

r (Y, Zoo ) = (2 Zoo - b 2 ) (y - 2 ( 2 z^_ 62 )) ■ 

To find a zero of q(Y) we shall need to have a generally convergent iteration for \J f lz — 62 . Thus, we set 
Uj (V) := V 2 + 62 — 2 Zj and apply Newton’s method for this, starting with initial guess vq and iterating k 
times and set Vk,j := Nk(uj,v 0 ). From q(Y) = (Y 2 + Zoo) 2 — r(Y, z^) = 0 we move to solve one of the 
factors 

63 


Q (V) -Y 2 + Zoo V2Zoo b 2 (v 2(2 zj- b 2 )) 


= 0 . 


2 ( 2^00 - 6 2 ), 

However, we can do this only based on approximative values for the parameters, so we set 

63 


Qk ’ j (F) ~ Y * + Zj Vk ’ j ( Y 2(2 Zj - 62)) 


= 0 . 


Now apply Newton’s iteration to this, say n times, using starting value yo and denote the output by y n ,k,j- 

yn,k,j ~ Nn(Qk,j : Vo) • 

Finally, we set x n ,fcj = Vn.k.j — «i /4 in order to get an approximation to a root of p. Suppose now 


j = ni,k = n 2 , n = no. If ni —> 00 then w. 


n 1 


W o 


and hence z ni 


3 , too. It is natural to denote 


u(V) := V 2 + 6 2 — 2 Zoo and correspondingly v n2 := N n 2 {u,v 0 ) and 

63 


Qn 2 {Y) = Y 2 + Zoo - v„ 


Y — 


2(2zoo - 6 2 ) 


= 0 . 


0 n 3 ,n 2 — Nns(Qn 2 >yo) tix/4. Then we have lim ril _). 00 x n3 , n2)7ll 


u n 3 ,n 2 


.If 


= x n3 . Finally Xoo = lim. 


713—^00 *^773 


IS 


Then in an obvious manner x. 

we denote x n3 = N n3 (Q, y 0 ) - ax/4, then clearly lim ri2 _ i . 00 x n 3 t n 2 

a root of p. 

The link to the SCI. One special feature of these towers which are build on generally convergent algorithms 
is the following: in addition to the polynomial p, the initial values for the iterations have to be read into 
the process via evaluation functions. Denoting the initial values for the three different Newton’s iterations 
by do = (wo, vq, yo) G C 3 we can now put this Doyle-McMullen tower in the form of a general tower as 
defined in Definition l2.4l with the slight weakening that, for each p £ P 4 , the tower might converge only at 
a dense subset of initial values. In particular, set 

r n3 : P 4 x C 3 A C, by (p, d 0 ) H> x n3 , 

^713,7-12 : P 4 x C 3 —X Cby (p,d 0 ) i-X x n3 ,„ 2 , 

-^ 773 , 772,771 ■ P 4 x CT x Cby (p,do) 1 x x n3 ^ n2 ^ ni . 

Thus, if we let 12 = P 4 x C 3 and E. A4 be as in Example |2T| (III), and complement A by the mappings that 
read wo, Vo■ yo from the input, then by the construction above and Theorem l8.2l vve have that 

SCI(S, fl) DM £ {2,3}. 


8.3. A height 3 tower for the quintic. Let 

p{X) = X b + a\X 4 + a 2 X 3 + a 3 X 2 + a A X + a 5 

be the given quintic. Doyle and McMullen f38l give a generally convergent algorithm for the quintic in 
Brioschi form. Thus, one needs first to bring the general quintic to Brioschi form, then apply the iteration 
and finally construct at least one root for p(X). In the following we outline a path for doing this, which 
follows L. Kiepert [f58l except that the Brioschi quintic is solved by Doyle-McMullen iteration rather than 
by using Jacobi sextic. This path can be found in l59l . 
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One begins applying a Tschirnhaus transformation Y = X 2 — uX + v to arrive into principal form 

q(Y) = F 5 + b 3 Y 2 + b 4 Y + b 5 . 

Here v is obtained from a linear equation but to solve u one needs to solve a quadratic equation Q(U) = 
U 2 + aU + P, where the coefficients a,/3 are rational expressions of the coefficients of p(X), (see for 
example p. 100, eq. (6.2-9) in |59|). 

Here is the first application of Newton’s method. We are given an initial value uo and iterate j times 
Uj = Nj(Q, uq). We may assume that v is known exactly but we only have an approximation Uj to make 
the transformation. So, suppose the Newton iteration converges to u 0 0 . Thus, we make the transformation 
using Uj and force the coefficients b^j = bij = 0 while keep the others as they appear. The transformation 
being continuous yields polynomials 

Qj{Y ) = Y 5 + b 3 jY 2 + b 4 jY + b 3 j, 

whose roots shall converge to those of q(Y). The next step is to transform qj(Y ) into Brioschi form. Let the 
Brioschi form corresponding to the exact polynomial q{Y) be denoted by B(Z) 

(8.2) B{Z) = Z 5 - 10 CZ 3 + 45 C 2 Z - C 2 = 0, 


while with Bj{Z) we denote the exact Brioschi form corresponding to q :l ( Y). The transformation from q(Y) 
to B(Z) is of the form 


(8.3) 


A + pZ 

(Z 2 /C) — 3' 


Here A satisfies a quadratic equation with coefficients being polynomials of the coefficients in the principal 
form (p. 107, eq. (6.3-28) in Il59l ). Let us denote that quadratic by R{L) when it comes from q(Y ) and 
by Rj{L ) when it comes from qj(Y) respectively. Thus here we meet our second application of Newton’s 
method. So, we denote by 

A k,j ’■= Nk{Rj, Ao) 

the output of iterating k times for a solution of Rj{L) = 0. And, in a natural manner, we denote also 


\k=N k (R,\ 0 ) and A= lim N k (R,\ 0 ). 

k—> oo 

The corresponding values of p, k .j , Pk and p are then obtained by simple substitution (p. 107, eq. (6.3-30) 
in l59lB . The Tschirnhaus transformation with exact values (A, //) transforms the equation not yet to the 
Brioschi form with just one parameter C but such that the constant term may be different. However, the 
last step is just a simple scaling and then one is in the Brioschi form (18.21 ). However, when we apply the 
transformation with the approximated values (A k ,j, Pk,j) °r with (A*, p k ) we d° n °t arrive at the Brioschi 
form. So, w & force the coefficients of the fourth and second powers to vanish and replace the coefficients of 
the first power to match with the coefficients in the third power. Finally, after scaling the constant terms we 
have the Brioschi quintics B k j and B k , e.g. 

(8.4) B kJ (Z) =Z 5 - 10 C kJ Z 3 + 45C 2 jZ - C% } = 0. 


Provided that the Newton iterations converge, that is, the initial values (ito, Ao) are generic, these quintics 
converge to the exact one. 

Here we apply the generally convergent iteration by Doyle and McMullen 081 . They specify a rational 
function 


T c (Z) = z- 12 


9c(Z) 

g'c(z) 


where g is a polynomial of degree 6 in the variable C and of degree 12 in Z. Starting from an initial guess 
w 0 from an initial guess w n +i = Tc(Tc(w n )) to convergence and applying Tc still once, we obtain, after 
a finite rational computation with these two numbers, two roots of the Brioschi, say zi and zjj. If applied 
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to the approximative quintics and if the iteration is truncated after n steps, together with the corresponding 
postprocessing, we have obtained e.g. a pair (zi trii k,j, Zu,n,k,j)- 

What remains is to invert the Tschirnhaus transformations. Suppose z is a root of the exact Brioschi form 
( 18 . 2 b . Then the corresponding root of the principal quintic is obtained immediately from ( 18 . 3 b 

A + /iz 

ty= (z 2 /C) — 3 ’ 

Naturally, we can only apply this using approximated values for the parameters. Finally, one needs to 
transform the (approximative roots) of the principal quintic to (approximative) roots for the original general 
quintic p(X). This is done by a rational function X = r ( Y ) where r(Y ) is of second order in Y and the 
coefficients are polynomials of the coefficients if the original p(X) and u and v (p. 127 , eq. ( 6 . 8 - 3 ) in | | 59 l ). 
Again, we would be using only approximative values Uj in place of the exact u. In any case, at the end we 
obtain a pair of approximations to the roots of the original quintic. If we put m = j, ri-2 = k and ri :i = n, 
then this pair could be denoted by ( Xi }U3 )Tl2 iHl , xu : „ 3 , n2 , ni ). 

The link to the SCI. In the same way as with the quartic, we assume that the initial value do = (uq , A (l , u'o) € 
C 3 is generic, so that all iterations converge for large enough values and since the transformations are con¬ 
tinuous functions of the parameters in it, all necessary limits exist and match with each others. The functions 
rn 3 ,n2,ri,i can then be identified in a natural manner: 

Tn3 ■ IP 5 x C C , by (p, do) 1 ^ (*£7,773 3 *£77,773)? 

1^713,722 • ^5 X C ^ C by (p, do) 1 ^ ,772 : XjJ jU3 , n2 ); 

1^773,772,771 • ^5 ^ C ^ C by (p,C?o) 1 ^ (x 1 ,773,772 , 77 X J Xjj , n 3 ^ n 2 ? ti 1 ) , 

where (xi^ n 3 tn 2 ,xjj^ n3tn2 ) and (xi t n 3 ,xu in3 ) are the limits as ni —> oo and 77.2 —> 00 respectively. These 
limits exist for initial values in an open dense subset of C 3 . Hence, we let Q. = P5 x C 3 , and S, Ad, A be as in 
case of the quartic. Then, by the construction above and Theorem l 8 . 2 l we have, again in a slightly weakened 
sense, that 

SCI( 5 ,F!)dm e { 2 , 3 }. 

8 . 4 . Particular initial guesses and height one towers. The special feature of the above mentioned Doyle- 
McMullen towers is that they address the question whether one can achieve converge to the roots of a poly¬ 
nomial p for (almost) arbitrary initial guesses. With slight change of perspective one might also ask the 
question how big the SCI gets if one applies purely iterative algorithms after a suitable clever choice of 
initial values. And indeed, the answer to that question is really satisfactory: For polynomials of arbitrary 
degree one can compute the whole set of roots (more precisely: approximate it in the sense of the Hausdorff 
distance) by a tower of height one which just consists of Newtons method. 

The key tool for the choice of the initial values is the main theorem of El: 

Theorem 8.3 (Hubbard, Schleicher and Sutherland 11541 1. For every d > 2 there is a set Sd consisting 
of at most l.lld log 2 d points in C with the property that for every polynomial p of degree d and every root 
z ofp there is a point s £ Sd such that the sequence of Newton iterates := {-/V™ (s)} converges 

to z. In particular, the proof is constructive, and these sets Sd can easily be computed. 

A further important property of Newton’s method is that, in case of converge, the speed is at least linear: 
If z n := N™(s) tend to a root z ofp then there exists a constant c such that \z n — z\ < c/n. Finally we have 
the following. 

Proposition 8.4. Let p be a polynomial ofdegree d, e > 0 and z n := Np(s). If\z n — z ra +i| < ^ thenthere 
is a root z ofp with \z n — z\ < e. 
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Proof. We have = \z n -z n+ i\ < g, hence |p(z„)| < e|p ^" )l . Decomposers) = allf =1 (a;-a; i ), 

notice that p'(x) = aYhi=i^-i=i,tyj( x — Xi), choose j such that |n f =1>ijt j(z n — xf) | is maximal, and 
conclude that 


\atti=i{z n - Xi )| = \p{z n )\ < 


e \p'{ z n)\ 


— e l a IIj_^ (z n Xi) |, 


thus \z n — Xj\ < e. Now z = Xj is a root as asserted. 


□ 


Let p be a polynomial of degree d. For each s £ S d let s n denote the nth Newton iterates of s, and define 

1 


( 8 . 5 ) 


$n • S £ S d , |5 n Sn+lj <C 


y/n 


Then (r n (p)) converges to the set £(p) of all zeros of p in the Hausdorff metric. Indeed, let z be a zero of 
p. By Theorem l8.3l there is an initial value s £ Sd such that s n = Nf(s) tend to z with at least linear speed, 
i.e. 

2c 1 

|s„ - s„+i| < |s„ — z I + |s n+ i - z\ < — < —J= 

n y/n 

for all large n, hence s n £ F„(p) for all large n. Conversely, each s n £ F n (p) has the property that its 
distance to the set £(p) is less than e = -^L by Proposition [S3] 

Therefore we define Q d = I 1 ,/ to be the set of polynomials of degree d, A4 the set of finite subsets of 
C equipped with the Hausdorff metric, and 3 : fl d —> Ai be the mapping that sends p £ ( .!,/ to the set of 
its zeros. Further A,/ shall consist of the evaluation functions that read the coefficients of the polynomial 
p £ fl d , and the constant functions with the values s £ S d - Note again that these values can be effectively 
constructed. 


Theorem 8.5. Consider (3, fid. A4, A f /) as above. Then the algorithms (18.51) define an arithmetic tower 
of height one for the computation of the roots of each input polynomial p, thus SCI(5, fl d , Af , A^)a < 1. 
Moreover, this tower employs just Newton’s Method, i.e. a purely iterative algorithm. 

9 . Open Problems 

Establishing the SCI opens up for a long list of open problems. In particular, we now need to classify all 
types of computational problems in terms of the SCI. One could think of essentially four main categories of 
problems: 

(I) Problems with SCI equal to zero. This class contains most of the problems in classical complexity 
theory. 

(Ila) Problems with SCI equal to one, where one also has error control. This set includes integration 
problems, ODE’s, root finding of polynomials, etc., and is the core of information based complexity 
lf%l as well as parts of real number complexity theory ED. 

(Hb) Problems with SCI equal to one, but where there is no error control. This set includes for ex¬ 
ample problems of computing spectra of tridiagonal self-adjoint infinite matrices like real discrete 
Schrodinger operators. 

(Ill) Problems with SCI greater than one. This class includes the main problems discussed in this paper, 
and this is the class that needs most work. In particular, we need a full classification theory for all 
problems in this class determining the SCI. What is presented in this paper is just the beginning. We 
predict that this class is vast and suggestions to problems that may be in this class follow below. 

Remark 9.1 (Adding structure to reduce the SCI). Note that classifying subclasses of Class III according 
to the SCI is about determining what kind of extra structure and information needs to be added to reduce the 
SCI from k to k — 1 and /,: 2 and so forth. This means also that one has to invent new algorithms (or towers 

of algorithms) as the tower used to obtain SCI = k obviously cannot be used for problems with SCI = k — 1. 
Thus, classifying all subclasses of III will lead to vast numbers of new algorithms. 
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9.1. Potential problems with SCI greater than one. 

9.1.1. Quantum mechanics. It is indeed likely that some of the fundamental computational quantum me¬ 
chanics have SCI greater than one. We have already discussed upper bounds of the SCI for computing 
spectra of Schrodinger operators 

(9.1) H = - A + V, V:R d ^tC, 

and we believe that the SCI is equal to two for computing spectra of these operators when considering the 
setup in Section[4]where V is bounded and we are using an Arithmetic tower. 

Similarly, this may very well be the case for Dirac operators. In particular, consider the same computa¬ 
tional problem {H, fl, Ai, A} as in Section|4j however, with the Schrodinger operator replaced by the Dirac 
operator. More precisely, let % = ffi| =1 .L 2 (R 3 ) and define (formally) Pj on H by Pj = ©f =1 Pj, with 
Pj = — igf-, for j = 1, 2,3, where Pj is formally defined on L 2 (R 3 ). Let Hq = a jPj + A where ay 

and (3 are 4-by-4 matrices satisfying the commutation relation 

(Xj&k A tx\zOLj 2 Sjfcl, j , k — 1,2,3,4, ol\ — (3. 

Then it is well known that H 0 is self-adjoint on ©j^W^.^R 3 ) where Vk 2 ,i(R 3 ) = {/ £ L 2 (R 3 ) : Tf £ 
L 2 (K 3 )} and L 2 (R 3 ) = {/ £ L 2 ( R 3 ) : (1 + | • I 2 ) 1 / 2 / £ L 2 (R 3 )}. Let V £ L°°(R 3 ) and define the Dirac 
operator 

4 4 

H d = H 0 + 0 V, V{H) = 0 W 2 ,i (R 3 ). 

k= 1 k =1 

The question is: what is the SCI of computing spectra of such Dirac operators? Note that the similarity 
between the Dirac operator and the Schrodinger operator suggest that we may very well be able to use the 
techniques from this paper to get the upper bounds. 

9.1.2. Infinite dimensional optimization and inverse problems. Given an A £ £>(( 2 (N)) and y £ ( 2 (N) and 
the optimization problem of finding 

(9.2) x £ argmin ||r?||;p subject to \\Ap — y\\ < 5, S > 0, p&[ 1, oo), 

r)ElP( N) 

where we assume that this problem is feasible for the given 5, meaning that there exists at least one minimizer. 
Such problems are popular in sampling theory and compressed sensing imaEi- In this case we may 
consider the computational problem {5, f2, Ai, A} similar to what we have in Section [j] however, we would 
first consider the closed metric space {77 : \\Arj — y|| < <5} and let A4 be the metric quotient space when 
identifying all the minimizer. Also, S = [a;] where a; is a minimizer and [ai] denotes the equivalence class 
corresponding to x. Note that if Q = x where = {A : A = A*, A is compact, ||A|| < M} 
for some M > 0 and f ?2 = {y ■ V £ Ran(A)}, where Ran(A) denotes the range of A, then it can be 
shown ll50l that SCI(S, Q )g > 2. This suggests that even in the case where there is a substantial amount 
of extra structure, there are many infinite-dimensional optimization problems and inverse problems with 
SCI > 1 . 

Interestingly, if we instead consider the problem of finding 

x £ argmin||Ap-y|| 2 + A||77 ||;p, A > 0, p£[l,2], 

7jei p ( n) 

with the similar setup as above then it can be shown lf50l that SCI(S, fl)o = 1. Similarly, we may consider 
the problems of finding 

p£ argmin i / (Tu — f) 2 dx + ATV(m), 
ueBV(n) 2 Jfi 


(9.3) 


A > 0, 
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and 

(9.4) g G argmin TV(it) subject to / (Tu — /) 2 dx < 6, A > 0, 

ueBV(n) -tn 

where CL C K. d is some appropriate domain, T : L 2 (Cl) —> L 2 (M . d ) is some linear operator, / is some func¬ 
tion, possibly not even in the range of T, BV (f2) denotes the set of functions that have bounded variations 
and 

TV(u) = sup |y udivudx : ||i>||oo < 1 

where £7(12, R 2 ) is the set of continuously differentiable vector functions of compact support contained 
in CL. The optimization problem (19.31 ) is highly popular in imaging lf80l for example and a well known 
regularization technique in inverse problems. The fact that solving (19.21 ) has SCI greater than one indeed 
suggests that solving ( 19.41 ) even when T is compact also has SCI greater than one. 

These are just two popular examples, however, there are essentially countless number of infinite-dimensional 
optimization problems 00} of the same nature as the ones above, thus we predict that there are many of those 
problems with SCI > 1. 

9.1.3. Operator semigroups and PDEs in unbounded domains. Suppose that A is a closed linear operator 
which is bounded from below: (Ax, x) > /i||:r|| 2 for some /i e 1 and x G D(A), and one is to compute, 
for a fixed t > 0, e~ tA or e~ tA Xo for some xo in the Hilbert space. In the former case the task is to 
compute approximations which converge in the operator norm and in the latter in the norm of the underlying 
space. The semigroup generated by the Schrodinger operator — iH is naturally of particular interest: given 
H = — A + V compute 

(9.5) i\> G L 2 (R n ), t > 0. 

If we consider if) to be known, then solving (19.51 ) can be done by solving a PDE, namely the time dependent 
Schrodinger equation. However, as V is defined on the whole space, this PDE would not have any boundary. 

A typical method for computing solutions to PDEs in unbounded domains is to impose non-physical 
boundaries with some boundary conditions on them, so that the solution within is close (in some sense) 
to the correct solution of the original problem. This arises is wave-type equations, including the (time- 
dependent) Schrodinger equation. A notable result in this field is that of Engquist and Majda ED, where a 
method for imposing artificial boundary conditions that minimize non-physical reflections is introduced. 

However, given a type of PDEs with an unbounded domain, this still leaves the question of what is the 
associated SCI unanswered. More precisely, one could ask for computational solutions that converge to the 
true solution of a given problem on some finite time interval [0, t]. Is this possible in one limit? Note that 
the method of adding artificial boundary conditions, if such method is successful, would essentially have 
two limiting procedures. In particular, one limit for each boundary chosen, and one limit when the boundary 
tends to infinity. Therefore, it is not unreasonable to believe that the SCI > 1 for certain types of PDEs in 
unbounded domains. 

9.1.4. Computational harmonic analysis. A problem in computational harmonic analysis is to compute 
frame bounds |[26l for a given frame for L 2 (R d ). As this means computing extremal spectral values of a 
self-adjoint infinite matrix we are faced with a several limit problem. First, computing spectra of a self- 
adjoint operator has SCI = 2. Second, producing matrix elements from the frame elements / G L 2 (R d ) 
may indeed add an extra limit, unless this limit can be collapsed with one of the other limits. However, this 
question is completely open and computing frame bounds may have SCI > 1. 
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10 . Proofs of theorems in Section[ 3 ] 

We start the sections on the proofs of our main results with a simple but fundamental observation on the 
smallest singular values ai(B) of matrices B £ C mx ", which constitutes one of the corner stones for most 
of the general algorithms we will construct in the subsequent proofs. 

Proposition 10.1. Given a matrix B £ C rnx n and a number e > 0 one can test with finitely many arith¬ 
metic operations of the entries of B whether the smallest singular value (Ji(B) of B is greater than e. 

Proof The matrix B*B is self-adjoint and positive semidefinite, hence has its eigenvalues in [0, oo). The 
singular values of B are the square roots of these eigenvalues of B* B. The smallest singular value is greater 
than e iff the smallest eigenvalue of B*B is greater than e 2 , which is the case iff C := B*B — e 2 I is positive 
definite. It is well known that C is positive definite if and only if the pivots left after Gaussian elimination 
(without row exchange) are all positive. Thus, if C is positive definite, Gaussian elimination leads to pivots 
that are all positive, and this requires finitely many arithmetic operations. If C is not positive definite, then 
at some point a pivot is zero or negative, at this point the algorithm aborts. □ 

Remark 10.2. In practice it may be advisable to use the Cholesky decomposition when determining if C is 
positive definite or not. This is simply for stability purposes as discussed in l50l . However, introducing the 
Cholesky decomposition requires evaluation of a radical. As Proposition ! lO.ll is the mainstay in most Arith¬ 
metic towers constructed in this paper, one can simply replace the Gaussian elimination with the Cholesky 
decomposition and obtain a Radical tower instead that may be more appropriate for computations. 

Proof of Theorem \3.3\ Since flj 3 SI 2 3 ^3 it obviously suffices to show that SCI(S, Hi)a < 2 and 
SCI(5, ^ 3)0 > 2. We will start by showing that SCI(3, Hi)a < 2, thus we will show the existence of an 
Arithmetic tower of algorithms of height 2 for 3 = sp on the set Hi, where Hi is determined by g in (13.21 ). 
So let A £ Hi and moreover let 

7 (z) ■= niin{cr 1 (A — zl ), or(A* -zl)} = ||(A - z7) -1 || -1 , 

where (Ti{B) := inf{||.B£|| : £ £ l 2 (N), ||£|| = 1} is known as the lower norm, the injection modulus, or the 
smallest singular value of B £ B(l 2 { N)). To see why min{or {A — zI),a\{A* — zl)} = ||(A — z/) _1 || _1 
see for example (49]. 

Step I (The construction of the tower of algorithms): Let g : [0, 00 ) —> [0, 00 ) be as in the statement 
of Theorem l3.3l in particular, continuous, vanishing only at x = 0 and tending to 00 as x —> 00 . Note that 
without loss of generality we can also assume that g is strictly increasing and g(x) < x for all x. Then the 
inverse function h{y) := g~ 1 {y) : [0, 00 ) —>■ [0, 00 ) is well defined, continuous, strictly increasing, h(y) > y 
for every y, and lim y _>o h{y) = 0. 

Let K C C be a compact set and 6 > 0. We introduce a <5-grid for K by G S (K) := (K + Bg{ 0)) (T 
(<5(Z + iZ)), where Bg( 0) denotes the closed ball of radius S centered at 0. Without loss of generality we 
may assume that is an integer, and obviously, G S (K) is finite. Moreover, introduce hs(y) := min{M : 
k £ N ,g{kd) > y} and observe that for each y, evaluating hg(y) requires only finitely many evaluations of 
g. Also, notice that h(y) < hg{y) < h(y) + S. For a given function £ : C —> [0, 00 ) we define sets T^(() 
as follows: For each z £ G S (K) let I z := Bf l6 ((( z )){z) (T (<5(Z + iZ)). Further 

• ifC(*)< 1 then introduce the set M~ of all w £ I z for which ((w) < £(i>) holds for all v £ I z . 

• Otherwise, if £( 2 ) > 1, just set M z := 0. 

Now define 


T'k'(C) := 1J M - 

zeG 6 (K) 


( 10 . 1 ) 
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Notice that for the computation of T^-(£) only finitely many evaluations of ( and g are required. To define 
the lowest level of the tower by this construction we introduce appropriate functions 

Cm,n{z) ■= mm{k/m : k £ N, k/m > mm{a 1 (P n (A - zI)P m |R an (p m )), 

( 10 . 2 ) 

<ri(P n {A*-zI)P m | Ran(Pm) )}}. 


Then we define 

r m , n (A) = := T^ (0) (C m,n), 

(10.3) 

T m (A) := lim T^ /m 

n—>oo 

where we will show that the limit exists. To show that this provides an arithmetic tower of algorithms for E we 
firstly mention that each of the mappings A K > T m n (A) is a general algorithm in the sense of Definition 12. 3 1 
Moreover, as mentioned above, the computation of (( m ,n) requires only finitely many evaluations of 

(m t n, and the finite number of constants g(k/m ) < 1, k = 1,.... Hence it suffices to demonstrate that, for 
a single z £ C, the evaluation of Cm,n(z) requires finitely many arithmetic operations of the evaluations Af, 
f £ Ar m „ (A). This is done as follows. For k £ N, we start with k = 1, then: 

• Check whether min{cri(P„(A - z/)P m | Ran ( Pm )),<Ti(P„(A* - zI)P m \ Ran ( Pm ))} < k/m. 

• If not let k = k + 1 and repeat, otherwise ( m ,n(x) = k/m. 

Note that the first step requires finitely many arithmetic operations of the complex numbers Af, f £ 
Ap m n (A) by Proposition 110.11 and the loop will clearly terminate for a finite /<:. Secondly, we have to 
show the desired convergence as n —> oo and then as rn -£ oo. 

Step II: r m „(A) — > r m (A), and r m (A) — > Si(A). To prove Step II we need the following result. 
Claim: Let K be a compact set containing the spectrum of A and 0 < <5 < e < 1/2. Further assume that 
c is a function with IIC-llloo.p : = II (C - l)Xk\\°° < e on K := (K + P^( d iam(if)+2e)+e(0)), where Xk 
denotes the characteristic function of K. Finally, let 


(10.4) u(£) := max{/i(3£ + h(t + £) — h(t )) + £ : t £ [0,1]}. 


Then we have that 

s p(^)) < w( e ) anc * Jim it(£) = 0. 

To prove the claim, let z £ G 5 (K) and notice that I z C K since, for every v £ I z , 

\z — v\ < hs(((z)) < hs(f(z) + e) < h(dist(z , sp(A)) + e) + <5 

(10.5) 

< /i(diam(Ff) + S + e) + <5. 

Suppose that M z ^ 0. Note that by (13.2b . the monotonicity of h, and the compactness of sp(A) there is a 
y £ sp(A) of minimal distance to z with \z — y\ < h(j(z)). Since ||i/ — 7 ^ j. < ewe get \z — y\ < 
/i(£(z) + e). Hence, at least one of the v £ I z , let’s say vq, satisfies |r>o — y \ < h(</(z) + e) — (i(£(z)) + <5. 
Noting again that 7 ( 110 ) < dist(u 0 , sp(A)), we get ((vo) < 7 (u 0 ) + e < h(((z ) + e) — h(((z)) + 2e. By 
the definition of M z this estimate now holds for all points w £ M z and we conclude that, for all w £ M z , 

dist(w, sp(A)) = h(g(dist(ui, sp(A)))) < h(j(u>)) 

( 10 . 6 ) 

< h(((w) + e) < h(h(((z) + e) - h(((z)) + 3e). 

This observation holds for every z £ G S (K) and all w £ M z , hence all points in Y R (£) are closer to sp(A) 
than u(e). 

Conversely, take any y £ sp(A) C K. Then there is a point z £ G S (K) with \z — y\ < 5 < e, hence 
£(z) < 7 ( 2 ) + e < dist(z, sp(A)) + e < 2e < 1. Thus, M z is not empty and contains a point which is closer 
to y than h(((z))+e < h( 2e) + e < u(e). Finally notice that the mapping (t,£) >->• /i(/i(i+£) —/i(i) + 3£)+£ 
is continuous on the compact set [0,1] x [0,1], hence uniformly continuous. Moreover, for every fixed t it 
tends to 0 as £ —> 0 , thus we can conclude u(/) —> 0 , and we have proved the claim. 
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Our next goal is the definition of suitable approximating functions for 7 and to build the bridge between 
7 and Cm,n- Here we have to take into account the following aspects: (i) The functions shall approximate 7 
locally uniformly, (ii) There shall be a compact set which contains sp(A) such that all of these functions are 
greater than 1 outside that set. To do so, define the functions 

7 m{z) := min{cri((H - 2 /)P m | Ran(Pm) ), cti((A* -zI)P m | Ran ( Pm ))} 

7 m,ni z ) '■= niin{CTi(P n (H- z/)P m | Ran(Pm) ), <7i(P n (H* - z/)P m | Ran(Pm) )}. 

Now cri(P n (H - -z/)P m | Ran (p m )) = inf{||P„(A - zI)P m £\\ : £ € Ran(P m ), ||£|| = 1} and cri((A - 
zI)Pm\R an (p m )) = inf{||(A - zI)P m £|| : £ G Ran(P m ), ||£|| = 1}. Thus, since P m —>■ I strongly 
and Pm+i > P m , then 7 m —> 7 pointwise and monotonically from above, and by Dini’s Theorem the 
convergence is uniform on every compact set, in particular on the ball K := B mo (0), with an mo > 2|| A|| + 
4 . Also, 7 m,n -A 7 m pointwise monotonically from below as n —> 00, hence again by Dini’s Theorem it 
follows that the convergence is uniform on the ball K = B mo (0). Outside that ball we have, for n > m, by 
a Neumann argument 


7 m,n(z) = min{( 7 i(P n (A - zI)P n P m ),a 1 (P n (A* - zI)P n P m )} 

> min{a 1 (P rl (A- zI)P n ),ai(P n (A* -zI)P n )} 

= ||(P„(A-z/)P n )- 1 |r 1 = |^|||(P n -^- 1 P„AP „)- 1 ||- 1 > 2. 

We can now directly show Step II: For all n > m > mo, the points in the finite set \ K 

lead to function values of C, m ,n being larger than 1, hence r mj „(A) = Fix e G (0,1/2). Then 

there is an mi > m 0 with mi > 3/e such that H 7 - 7 m< e/3 on K := B h ^ diam ^ +2e)+e (0) for all 
m > mi. Moreover, for every m there is an ni{m) such that \\"f m — 7 m,n || 00 x < e /3 f° r n > niim). 
This yields 

II7 ~ Cm.nlloo x — Il7 7m || 00 x Il7m ~ 7m,ra || 00 x Il7m,ra — Cm^Hoo X 

( 10 . 7 ) 

< e/3 + e/3 + 1/m < e 

whenever m > mi and n > ni(m). Hence, by the above claim, it holds that dR(T m , n (A), sp(A)) < u(e) 
whenever m > mi and n > n\{m). Since this bound tends to zero as e —> 0, it is proved that 


lim limsupdH(F min (A),sp(A)) = 0 . 

m—> o° n—>oo 

To ensure that (r m n (A))„ g N already converges w.r.t. the Hausdorff distance for every fixed m we just 
mention that the sequence of functions { 7 m ,n}n is monotonic from below, hence {Cm,n} n is monotonic as 
well. Moreover, for every fixed m, these ( m ,n are effectively evaluated only in finitely many points (namely 
the points w G M z with 2 in G 1 / m (K)) and can take only finitely many values, where the bounds do not 
depend on n. Thus the sets F TOj „(yl) change only finitely many times as n grows. Consequently, there 
is an n 2 {m) such that all T myn (A) with n > n 2 (m) coincide. This provides the limiting set r m (A) = 
linpj^oo Tm,n(H) and hence we have shown that SCI(S, 17 i)a < 2. 

Step III: SCI(S, {> 2 . To prove that there is no General tower of height one for the self-adjoint 
case we argue by contradiction and suppose that there is a sequence { 17 ,} of general algorithms such that 
Ffc(A) —> sp(A), and in particular each Ap fe (A) is finite. Thus, for every A and every k there exists a finite 
number N(A, k) G N such that the evaluations from Ap fc (A) only take the matrix entries A i3 = (Aej, e*) 
with i,j < N(A , k) into account. We consider operators of the type 


( 10 . 8 ) 


r =1 


with {l r } C N and A n 


/I 

0 




G C nxn 


0 


\1 


1 / 
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Then sp(A n ) = {0, 2}, hence A is bounded, self-adjoint, and sp(A) = {0, 2} as well. In order to 
find a counterexample we simply construct an appropriate sequence {l r } C N by induction: For C := 
diagjl, 0, 0,0,.. .} one obviously has sp(C) = {0,1}. Choose fco : = 1 and h > N(C, ko). 

Now, suppose that l\ ,..., l n are already chosen. Then we obviously have that sp (A^ ® ■ ■ ■ ® Ai n ® C) = 
{ 0 , 1 , 2 }, hence there exists a k n such that 

r fc (A h ® • • • ® A ln ® c) n Si (l) ± 0 

for every k > k n , where B± (1) denotes the closed ball of radius 1 jn and centre 1. Now, choose 

(10.9) l n+1 > N(A h ®-‘-®Ai n ®C,k n )-h-l2-...-l„. 

By this construction, it follows that 

(10.10) r fen {®^ =1 Ai r )n Bx (i) = r fen (A h ®...®i in ®C)nBi(i)^0 Vne n. 

Indeed, since any evaluation function £ A just provides the (i, j)-th matrix element, it follows by (1 1 0.91) 
that for any evaluation functions fij £ Ap,. n (A^ ® ■ ■ • ® Ai n ® C) we have that that 

fij(Ai i ® • • • ® A k ® C) = /ij(®“ i A ir ). 

Thus, by assumption (iii) in the definition of a General algorithm (Definition l2.3b . we get that Ap fcri ( A ; 1 ® 

• • • ® Ai n ® C) = Ar fen (®j^LiA/ r ) which, by assumption (ii) in Definition 12.31 yields (IIP. 10b . So, from 
( 110 . 10 b . we see that the point 1 is contained in the partial limiting set of the sequence {rfc(®ji2. 1 A/ i .)}^ =1 
which approximates sp(A) = {0, 2}, a contradiction. Finally, we note that the assertions w.r.t. Plf 1 are now 
obvious. □ 


Remark 10.3. We note that in the case of self-adjoint bounded operators the spectrum sp( A) is real and the 
function g can be choosen as x K > x. Thus, in the definition of (£) it suffices to consider compact KcR, 
the real grid G S (K) := (K + [—<5, 5]) (T (8Z), and for all z £ G S (K) only the two points Z\/ 2 := z ± £(z) 
in I z . Also in the case of normal operators, where g : x >-> x does the job again, the construction simplifies. 
In particular, for a given function £ : C —► [0, oo) we may define sets T^-(£) as follows: For z £ G S (K) 
consider I z := {z + C,{z)e^ Sl : j = 0,1,..., [27T() _1 ] } and define T S K (() again as in (IIP, lb . The proof is 
then the same, up to some obvious adaptions. 


Proof of Tlieorem \3. 71 Step I: SCI(S2,fli) G > 2. Actually, we will even see that this holds already for the 
set of all bounded self-adjoint operators. The proof is just an appropriate adaption of the third step in the proof 
of Theorem 13. 3 1 Assume that there is a sequence {IA} of general algorithms such that IA (A) —>■ sp v f (A) 
for all A £ fli, and consider operators of the type (110.8b . For sufficiently small e the (N. e)-pseudospectrum 
is a certain neighbourhood of {0, 2} disjoint to Bi (1), independently of the choice of {Z r }. By exactly the 
same procedure as before one obtains again that 1 belongs to the partial limiting set of Tfc(A) for a certain 
A, hence a contradiction. 

Step II: SCI(Si,( 22 )g > 2. Recall that denotes the set of bounded operators on / 2 (N) whose dis¬ 
persion is bounded by /. Thus, to show the claim, it suffices to show that for any height one general tower 
of algorithms {T^j-ngN for Si there exists a weighted shift S, with (Su)i = 0 for all u £ Z°°(N) and 
Se n = a n e n+ i where a = {a n }neN € Z°°(N), such that r m (S) Si (S') when m —> oo. Obviously 
S £ fl2- To construct such an S we let 


= { 0 , 0 ,..., 0 , 1 , 0 , 0 ,..., 0 , 1 , 1 , 0 , 0 ,..., 0 , 1 , 1 , 1 , 0 ,...}, a h+1 ,a h+2 ,...,a h+j = 1 , 


for some sequence {Zj where lj + % > lj + 2 j that we will determine. Observe that regardless of the 
choice of {Zj we have that sp(S) = B i(0) (the closed disc centred at zero with radius one). Indeed, 
on the one hand ||S|| = 1, hence sp(S) C B i(0). On the other hand, one can define the elementary shift 
operator V : e n i —> e„+i, n £ N, and its left inverse V~ : e n +i i —> e n , n £ N, ei i —> 0. Then the shifted 
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copies (y~) lj SV lj converge strongly to the limit operator V whose spectrum sp(V) = B i(0) is necessarily 
contained in the essential spectrum of S (cf. 1771 or (63j). 

To construct S we will inductively choose {lj}jen with the help of another sequence {mj} ge z + that 
will also be chosen inductively. Before we start, define, for any A £ Sl 2 and m £ N, N(A, to) to be the 
smallest integer so that the evaluations from Ar m (A) only take the matrix entries Aij = ( Aej,et ) with 
i,j < N(A, m ) into account. Now let too = 1 and choose l\ > 7V(0, mi). Suppose that l \,..., l n and 
too, ... ,m n -1 are already chosen. Note that sp (P r S) = {0}, since P r S = P r SP r can be regarded as 
a r x r-triangular matrix with zero-diagonal. Thus, since by assumption {I'm}mf=n is a General tower of 
algorithms for Hi, there is an m n such that T m (Pi n+n+ iS) C B 1 (0), for all to > m n . Let 

(10.11) l n +i > N(Pi n+n+ iS,m n ) such that also Z „ +1 > l n + 2n. 

Then, it follows that r m „(S) = T m n {Pi n+1 S) = T mn (Pi rl+n+ iS). Indeed, since any evaluation func¬ 
tion fij £ A just provides the (*, j)-th matrix element, it follows by (110. 1 II) that for any evaluation func¬ 
tions fij £ A r mn (S) we have that (S) = fi,j(Pi n+1 S) = fij(Pi n + n +iS). Thus, by assumption 
(iii) in the definition of a General algorithm (Definition 12.3b . we get that Ar mra (S) = Ar m (P / n+1 S) = 
Ar m „ (Pl„+n- 1-1 S) which, by assumption (ii) in Definition [23] implies the assertion. Thus, by the choice of 
the sequences {lj}jen and { m j}je z+, it follows that T m „(5) = T mn (Pi n+n+ iS) C Bi( 0) for every n. 
Since sp(S) = f?i(0) we observe that T m (S) ->*> Hi (S'). 

Step III: SCI(Hi, G| )g > 3. This proof requires tools from the section on decision making and logic. 
The reader is encouraged to read Section[7]before embarking on the proof that can be found in Section fl4l 
Step TV: SCI(Hi, Gi)a < 3 and SCI(H 2 , Gi)a < 2 . Let A £ Gi = B(l 2 (N)), and e > 0. We introduce 
the following continuous functions 7 ^ : C —> R + , 7 ^ : C -7- R+ and 7 ^ n : C —> R+, 

-y N (z) := (minjar ((A - zl) 2 ") )CTl ({A* - 5 /) 2 ”)}) 2 

7 m( z ) : = (min {a, ((A-zI) 2 " P m ) ,07 ((A* - zl) 2 "P m ^ j) 

2~ n 

7 £,„(*) := (min {(71 ((P„(A - */)P n ) 2 W P m ) , cti ((P„(A* - zI)P n f N P m )J) 

where 07 (P) denotes the smallest singular value of B, and in the terms like < 7 i(P„PP m ) the operator 
P n BP m is regarded as element of *B(Ran(P m ), Ran(P„)). We define the desired approximations r m „(A) 
for spa i£ (A) by 

r m , n (A) := {z £ G m : 7 *„(z) < e}, 

where G m := (TO _1 (Z + iZ))nP m (0). Due to Proposition ! 10. H it is clear that the computation of r m n (A) 
requires only finitely many arithmetic operations on finitely many evaluations {(Ae g , e*) : i, j = 1,..., n} 
of A. Now, one can show that 

lim lim r m n (A) =spjv £ (A), 

ra—>■ 00 n—>oo 

which proves that SCI(H 2 , Gi)a < 2. Furthermore, since dn(sp N e (A), sp(A)) —> 0 as e —> 0, we arrive at 
an Arithmetic tower of algorithms of height 3 for Hi with the algorithms at the lowest level given by 

r fc , m> n(A) := {z £ G m : 7 Z,ni z ) ^ 1 / fc }- 

For the proofs and details we refer to ||49l . 

Step V: SCI(Hi, G 2 ) A < 2 and SCI(^. 2 , G 2 )a = 1. Let A be such that / is a bound for its dispersion, and 
e > 0. W.l.o.g. we can assume that f(n) > n for every n. Define the composition g := /o- • •o/of2 w copies 
of /. Besides the already defined functions 7 ^, 7 ^ and 7 ^ n we additionally introduce 4’m '■= 7^ g(m)’ i- e - 

2~n 

■= (minjcri ((P s(m) (A- zI)P g{m) ) 2N P m ^ ,a 1 ((P g(m) (A* - ^/)P g(m) ) 2 ”P m ) j) 
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and we define the desired approximations T m (A) for sp^y e (A) by 

r m (A) := {z£G m :^(z)<e}. 

Again, the computation of T m (A) requires only finitely many arithmetic operations on finitely many evalu¬ 
ations {(Aej,ei) : i,j = 1,... ,g(m)} of A. 

Obviously, there exists a compact set K C C such that 7m(“) > 2e and > 2e for all z £ C \K 

and all m. Further note that converges to 7 ^ uniformly on K. Indeed, since all z 1 —>• (P s ( m ) (A — 
zI)P g{m) f N P m and z 1 —> (A — zl) 2 ™ P m are operator-valued polynomials of the same degree whose co¬ 
efficients converge in the norm due to the choice of the function g, we can take into account that \cr\(B + 
C ) — o\(B)\ < He'll holds for arbitrary bounded operators B, C, and we arrive at the conclusion that 
\"/m( z ) ~~ I —> 0 as m —> 00 uniformly w.r.t. z £ K. In order to simplify the notation we choose a 

sequence (S m ) which converges monotonically to zero such that 

7m (z) +5 m >ipZ ( z ) > 7m ( z ) - dm for every m and every z £ K. 

Moreover, we point out that each of the functions z v-> ( z ) is continuous on the compact set K, hence 

even uniformly continuous, and we can assume without loss of generality that, for every m, 

(10.12) I tpm( z ) - *l>m(y)\ < dm for arbitrary z,y £ K, \z - y\ < 1/m. 

Now let A e (A) := {z £ C : 7 N (z) < e} as well as 

A e , m (A) := {z £ C : 7 %(z) < ej, ^ e , m (A) :={:£C: ip%(z) < e}. 

By the discussion above, we conclude for all m> k that 

(10.13) A e+Sk ,m{A) D A e+Sm ,m(A) D ^e,m(A) D A e _ Sm ,m( A ) A A e _^, m (A). 

Since, P m < P m +1 and P m —> I strongly, 7 // —y j N monotonically from above pointwise (and hence 
locally uniformly by Dini’s Theorem). Thus, by ll49l . A e+ s k , m {A) —»■ A £+( 5 fc (A) = sp Are+ 5 fc (A) and 
A e _a fc , m (A) -> A e _ 5 fc (A) = sp Nt£ _ Sk (A) asm ->■ 00 . Hence, since sp N ^ ±s fc (A) -7 sp 7 Ve (A) as k -7 00 , 
(110. 1 3b yields lintm-nx, T' e m (A) = sp^y e (A). To finish the proof we observe that it is clear that on the 
one hand *I/ e , m (A) D T m (A). On the other hand, for sufficiently large m it holds true that for every point 
x £ 'Fg— 5 m ,m (A) there is a point y x £ G m nU 1/rn (x) and, by ( 110.12b we get \ip%(y x ) - ip%(x)\ < S m that 
is y x even belongs to r m (A). Thus 

T m (A) +B 1/m (0) D ^ e - Sm , m (A) 

for sufficiently large m. Combining this, we arrive at 

v Fc,m(A) + B 1/k ( 0) D T m (A) + B 1/m ( 0) D v F e -,5 m ,m(A) D $ e _ 4j>iI „(A), 

for m > k large. By the above, the sets on the left tend to sp N e (A) + B i/fc(0) as m —> 00 , and the sets on 
the right converge to sp^y e _ Sk (A) for every k. Since both of these sets converge to sp^y e (A) as k —> 00 this 
provides lim m _>oo T m (A) = sp Ar e (A). 

The already mentioned fact that spjy e (A) —> sp(A) as e —»• 0 finishes the proof of this Step V. 

Step VI: We are left with the compact case. Since f(n) := n is a bound on the dispersion for every 
compact operator K, we can reuse the algorithms 

T^(iT) := | z£G n : (<n ({P n {K - zI)P n ) 2N ))' < ej 

from Step V to obtain an Arithmetic tower of height one for the pseudospectra sp v £ (K). Now we claim 
that 

T n {K) := r y /n (K) = {z£G n :a 1 ( P n {K - zI)P n ) < 1/n}, n £ N, 
do the job for the spectrum sp(iT), i.e. for every d > 0 there exists no £ N such that dn(sp(K), T n (K)) < S 
for all n > no- Fix S > 0 and choose e > 0 such that sp e (K) C sp (K) + P< 5 / 2 ( 0 ). Then, due to the above. 
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there exists m > 1/e such that T°n l ^ n {K) C T° ,e (A") C sp e (AT) + . 65 / 2 ( 0 ) C sp(A") + 65 ( 0 ) for all 
n > n\. Conversely, we know (see e.g. If39l Part II, XI.9 Lemma 5]) that sp (P n KP n ) —> sp(A'), i.e. there 
exists 77.2 > ni such that sp(AT) C sp (P n KP n ) + 65 / 2 ( 0 ). Choose no > n 2 such that no > 2||/v || + 1 and 
no > 2/5. Then, for all n> no and every z £ sp (P n KP n ) there exists z n £ G n with \z — z n \ < 1/n, hence 
cti ( P n (K — z n I)P n ) < 1/n, which implies that z n £ r°’ 1 ^"(Jf). Thus, sp(P n KP n ) C r°’ 1 ^ ra (AT) + 
6i/ n (0) C r°’ 1 ^"(iT) + 65 / 2 ( 0 ), and we conclude that sp(iT) C T°’ 1 ^ n (A') + 65 ( 0 ) for all n > no- □ 

Proof of Theorem \3.8\ Step I: SC1(E|. f^G = SCI(E|, 0 2 )a = 1. To see that, recall the algorithms (1 10.3b 
for the more general setting of Theorem l3.3l and plug in the bound / on the dispersion to obtain the general 
algorithms T m (A) := (£„,), where 

(m(z) := min{fc/m : k £ N, k/m > min{cri( 6 / ( m )(A - zI)P m ), < 7 i( 6 /( m )(A* - zI)P m )}} 

and is defined in (110. 11 1 . Then (110.7b becomes 


II 7 Cmlloo /f L II 7 7m|| 00j /f + Wlm 7m,/(m) lloo.A + ||7m,/(m) CmH^x 

< e /3 + e /3 + 1 /m < e 


for m larger than a certain m 2 > mi (cf. Step V in the proof of’Theorem l3.3| i. Again, by the claim in Step II 
of the proof of Theorem l3.3l we find dn(r m (A), sp(A)) < u(e) whenever m > m 2 . Thus, it is proved that 
linim^oo dH(T m (A), sp(A)) = 0 and the rest is obvious. 

Step II: SCI(S 2 , f^G > 2. To see that, let {o n } n£ R C {0,1} be a non-constant sequence and A = 
diag{«,} a diagonal operator on / 2 (N). Then the spectrum equals {0,1}. The essential spectrum contains 
the point 0 if and only if {a n }neN has infinitely many 0s. Thus, if SCI(S 2 , D 2 )g = 1, i.e. there is a General 
height-one tower of algorithms {I’/ ; }/.:«=[ [ which computes the essential spectrum of A, then we can define 
algorithms for a respective decision problem (e.g. S((o n }): Does {a„} ne N contain infinitely many 0s?) via 
ffe({a n }) = Yes iff dist(0, Tfc(A)) < 1 / 2 . Thus, SCI(S)g = 1 which contradicts Theorem l7.ll 

Step III: SCI( 52 ,1 1 1 )g > 3. This proof requires tools from the section on decision making and logic. 
The reader is encouraged to read Section [7]before embarking on the proof that can be found in Section fl4l 
Step IY: SCI(5 2 , )a ' 3. 3Ve start by defining the following functions on C, where (/,, .— I Pn , 


f^m,n,k '• Ai (*■ min{ai(Pfc(A XI)Q rn P n ') : tri(Pk(A A/) QmPn )} 
fi m ,n ■ A min{cr 1 ((A - XI)Q m P n ), a 1 ((A - \I)*Q m P n )} 

f-m ■ A min{cri((A - A I)Q m ), °i ((A - A I)*Q m )}. 


Here P k {A — XI)Q m P n is considered as operator on Ran (Q m P n ), etc. as usual. Recall from the previous 
proofs that, for every n, m, iJ<m,n,k —> fim,n pointwise and monotonically from below as k —> 00 and for 
every m n — > n m pointwise and monotonically from above as n —> 00. Furthermore, {p. m } m gN is 
pointwise increasing and bounded, hence converges as well. Next, we define the finite grids 



and, for A £ l?i, 



(10.14) 


r m,n(A) := P| r m „ k(A) = lim T m ritk (A) 

11 fc-> 00 


(10.15) 


r m (A) := M T m n (A) = lim T m n (A), 

V — 77 ,—kOO 


(10.16) 


T(A) := f) T m (A) = lim T m (A). 

m —koo 
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With Proposition ! 10. H it easily follows again that all „ are general algorithms in the sense of Definition 
12. 3 1 that require only finitely many arithmetic operations. Thus, in order to show that this provides indeed 
a tower of algorithms for S 2 we need to establish the limits in ( IIP. 141) . (110.15b and ( 110.16b and that r(A) 
equals sp ess (A). To do that we abbreviate H := Z 2 (N) and show that 

(10.17) /x(A) := lim // m (A) equals ||(A — XI + /C(?f )) -1 || -1 for all A £ C, 

m—> 00 

where A — XI + /C(7f) denotes the element in the Calkin algebra B(R) j K.(R) and where we use the con¬ 
vention ||h _1 || _1 := 0 if the element b is not invertible. Clearly it suffices to consider A = 0. The estimate 
“<” is trivial in case /r(0) = 0. So, Let /i(0) > e > 0. Choose m £ N such that fi m (0) > /i(0) — e. The 
operator A 0 := AQ m : Ran Q m —> Ran( AQ m ) is invertible, hence the kernel of A = AQ m + AP m has 
finite dimension. <Ji(A*Q m ) > 0 yields that Ran,4 has finite codimension, hence both A and AQ m are 
Fredholm. Let R be the orthogonal projection onto Ran AQ TO , /j 0 the inverse of Aq and B := BqR. Then 

BA- I = (BA- I)P m + (BA - I)Q m = (BA - I)P m and 
AB — I = (AB - /)(/ - R) + (AB - I)R = (AB - /)(/ - R) 
are compact, i.e. B is a regularizer for A. Now 

||(A + /C(7f))- 1 ||- 1 >|| J B ||- 1 = || J B 0 J R ||- 1 

> (IIRollPII)- 1 = ll^oir 1 = <Ti(AQ m ) > m - e 

gives the estimate “<” since e is arbitrary. 

Conversely, there is nothing to prove if A is not Fredholm, so let e > 0 and B £ (.4 + K.(R))^ [ be a 
regularizer with ||73|| < ||(zl + + e. Since the operator K := BA — I is compact we get for 

all sufficiently large m that \\Q m BAQ m — Q m \\ = \\Q m KQm\\ is so small such that Q m + Q m KQm is 
invertible in £>(Ran(Q m )), 

Qm(Qm T QmKQm) Qm.B AQ m = Qm and || Qm.B B\ | 5 6. 

^ . -' 

=: B ± £ B(U) 

We get that <ri(AQ m ) > 0, hence the compression AQ m : Ran(Q m ) —> Ran (AQ m ) is invertible and 
the compression 7?i|R a n(yiQ m ) : Ran (AQ m ) —> Ran(Q m ) is its (unique) inverse. Thus, we have ||i?i|| > 

11 | Ran( AQ m ) 11 = (Ji (AQ.^ 1 and further ||R|| > \\Q m B\\ > ||i?i|| - \\Q m B - B x \\ > a 1 (AQ m )~ 1 - e. 

We conclude for sufficiently large m that cti(AQ m) -1 < ||73|| + e< ||(A + K('H))~ l \\ + 2e. Since e > 0 is 
arbitrary we arrive at linim-j.oo cri(AQ m ) > ||(A + /C('H)) -1 || -1 . Applying this observation to A* we also 
find 

lim ai(A*Q m ) > \\(A* + /C (^)) -1 || -1 = \\(A + /C(H)) - 1 |r\ 

m—too 

which finishes the proof of (110. 171) . In particular we now can apply that all of the above functions 
Bm,n, fhn, are continuous w.r.t. A, and together with the already discussed pointwise monotone conver¬ 
gence results, Dinis Theorem gives that the convergences are even locally uniform. 

We can now establish the limits in (110.14k (110. 151 ) and (110. 161 ). Notice that the limits exist since the 
respective sets under consideration are nested and uniformly bounded: Obviously, {T m ^ n ^(A)}k is de¬ 
creasing. Further {r m ,n(A)} n is increasing since, for every k, T mtn (A) C Fm,n,fe(A) C r m ,n+i,fc(A). 
Finally, {r m (A)}m is decreasing. To see this, choose z £ T m (A) and a sequence ( z n ) with z n z and 
z n € r m ,n(A), respectively. With the inclusions T mtn (A) C r min / c (A) C r m -i,rt,fc(A) we conclude 
z n € Tm—i,n(A) for every n, hence z £ r m -i(A). 

We are left with proving that T(A) equals sp ess (A). Let A £ sp ess (A). For m £ N, p, m (A) = 0 
and furthermore, there is an no(m) and a A m £ Gn 0 (m)> |A — A m | < 1 /m, // m (A m ) < l/( 2 m) and 
Bm,n (Am) < 1/m for every n > no(m). Then, for every k, Hm,n,k(^m) < 1/m as well. We conclude 
that Am 6 r m (A) C Ti(A), l = Thus the limit A of the sequence {A m } belongs to all r*(A), 
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hence to T(Al). Conversely, let A ^ sp ess (/l). Then //(z) > e > 0 for a certain e > 0 and for all z in a 
certain neighborhood U of A. Moreover there is an mo > 3/e such that /x m (z) > e/2 for all m > mo and 
z € U, hence fx m ,n(z) > e/2 for all m > mo, all n and all z € U. Further, for every m > mo and n 
there is a ko(m, n ) such that n m ,n,k(z) > e/3 > 1/mo > 1/m for all k > ko(m , n) and z £ U. Thus, the 
intersection of U and T(A) is empty, in particular A ^ T(Al). 

Step V: SCI(3 2 ,n 2 ) A < 2. Knowing a bound / on the dispersion of A obviously suggests to plug it into 
the previously defined algorithms, i.e. 


Am,n - A I y min{(Ji {Ef(n) (Al ^I')QmPn')i {Ef(n) A/) l?m-Pn)} 



Unfortunately, all we know about the functions fi m ,m Am is that they are Lipschitz continuous with Lipschitz 
constant 1 and that converge pointwise to , but not, whether or when this convergence is monotone. 
Therefore we have to make a modification in order to guarantee the existence of the desired limiting sets. 
Without loss of generality, assume that f(n) > n for every n. Let A m denote the square A m := {z £ 


C : |5i(z)|, |3(z)| < 2 ( m+1 )} and A m (A) := A + A m the respective shifted copies. Moreover, set 
Z m ■= {: s, t £ Z} and 


S m ,n( A) := {* = m + 1 ,..., n : 3z £ A m (A) D G l : < 1/ TO I 

T m ,n( A) := {i = m + 1,... ,n :3z £ A m (A) D G t : ) < 1 /(m + 1 )} 

£'m,n(A) := |S , m>n (A)| + |Tm^ n (A)| — n 


Im,n •— {A C Z m Ern,n(ty ^ 0} 


r m ,„(A):= (J A m (A). 


A 


Roughly speaking, r m ra ( J 4) is the union of a family of squares A m (A) with E m ^ n (X) being positive, which 
is the case if “most of the / l m p are small on A m (A)”. 

To make this precise, we first notice that all p. m ,i{z), i > m + 1, with z outside the compact ball K := 
/i- 211.41 :-2 (0j are larger than one, /,„ n are finite, and all T m ,n(-4) are contained in A', due to a Neumann 
argument as in the proof of Theorem l3.3l Further (i m ,n Am uniformly on K. To see this let e > 0, cover 
K by U e /z(z), z £ K, and pass to a finite subcovering U e /^{zj), j = 1,... ,1. Then choose n large enough 
such that \ fi m ,n{zj) — ^m{zj )| < e/3 for all j = 1,..., l, and find \[i m ,n{z) — Am(~)| < e for all z £ K, 
due to the Lipschitz continuity of ji rn , n and //,„. 

We will show that for each m > 5 the E mtn { A) are pointwise monotone w.r.t. n for every A £ Z m (T K, if 
n is sufficiently large. That is, for every A there is an n( A) such that either E m ^ n { A) < 0 or £ , T 71 j „(A) > 0 for 
all n > n{ A). Taking the maximum N of the finite set {n( A) : A £ Z m (T K} then yields that the r mj „(Al), 
n > N, are constant, hence converge as n —> oo. Then we denote the limiting set by L m (A), and claim 
that {T m (A)} m is a decreasingly nested sequence, hence converges as well. Indeed, let z £ r rn+ i( J 4), then 
z G r m+ i, n (A) for large n, i.e. z £ A m+ i(A) for a A G I m + i,n, i.e. A £ Z m+1 and E m+ i >rl (A) > 0. 
Clearly, there exists a Ao £ Z m with A m+ i(A) C A m (Ao), and further (since fi m .i( z ) < Am+i,i(^) holds 
whenever n > m + 1 ) 

S , m+i,n(A) = {i = m + 2,..., n : 3z G A m+ i(A) D Gt : jx m +i,i(z) < l/(m + 1)} 

C {i = m + 1,..., n : 3z £ A m (A 0 ) (T G* : p, m ,i( z ) < 1/m} = S m ,„( A 0 ) 

and analogously T m+ i jn (A) C T m>n ( A 0 ). Therefore E m+ i,„(A) < E m ^ n { A 0 ), which shows that A 0 G J m ,„ 
and thus z G r m (Al). 

So, let us now come to the monotonicity of the {-E m n (A)} n . For fixed A and m > 5 we have to consider 
three possible cases: The first one is fx m (z) > 1/m for all z G A m (A). Then there exists an no such 
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that fi m , n {z) > 1/to f° r ah n > no and all z G A m (A) (take into account that A m (A) is compact and 
Am,™ —> Hm locally uniformly), hence |S m , n (A)| + \T mtn (X)\ is constant and E m>n ( A) is monotonically 
decreasing. Secondly, assume that // m (z) < 1/m for all z G A m (A). Then there exists an no such that 
Am,n(“) < 1 /to for all n > no and all z G A m (A), hence |S , m . n (A)| = n — c with a certain constant c, 
and E m , n ( A) = |T mi „(A)| — c is monotonically increasing. Finally, assume that 1 /to belongs to the interval 
[min {n m {z) : z G A m (A)}, ma x{p, m (z) : z G A m (A)}] and notice that the length of that interval is at most 
2 -m which is less than 1/m — 1/(to +1) for to > 5. Then there exists an n o such that p, m ,n(z ) > l/(m+1) 
for all n>n 0 and all 2 G A m (A), hence {|T , mj „(A)|} n >„ 0 is constant, and T m>n (A) = (|jS' TrejTl (A)| - n) + 
|T TOiT 1 (A)| is monotonically decreasing. 

It remains to prove that the final limiting set lim m T rn (A) coincides with the essential spectrum. If 
zo G sp ess (A) then fi(zo) = 0, hence /r m (zo) = 0 for all to. So, for fixed to, we have fim,n(z ) < l/(m + 1) 
for all sufficiently large n and all z in the neighborhood tli/( 2 m)( 4 o)- Choose A G Z m such that Zq G 
A m (A) C U i/( 2 m) (zo)- This is possible since to > 5. Then it is immediate from the definitions that 
E m ,n( A) = n — c with a constant c for all sufficiently large n, hence zo G r m>n (.A) for m,n large, which 
yields that z 0 G lim m lim„ T m ^ n (A). Conversely, let zo </ sp ess (A), i.e. n(zo) > 0. Then, for large too, 
there exists an e > 3/too such that /J, m (zo) > e, hence also p, m ,n{zo) > e/2 for to > too and large n, and 
then also fim, n (z) > e/3 > 1/too for all 2 in a certain neighbourhood U of zq, which does not dependent 
on to > Too, and all large n. For all sufficiently large to > Too all A m (A) which contain z o are subsets of 
U, hence E rn n {X) = d — n with a constant d for large n, that is T m ,n(A) D U = 0. Thus zo is not in the 
limiting set. This finishes the proof. □ 

11. Proofs of theorems in SectionQ] 

Remark 11.1 (Fourier Transform). In this section we require the Fourier transform on L 2 (R d ), which 
will be denoted by T : L 2 (R d ) —> L 2 (R d ). Our definition of T is as follows: 

[^]( 0 = [ dx. 

JR d 

For brevity we may write i/j instead of Eip. With this definitions T is unitary on L 2 (R d ). 

Remark 11.2 (The Attouch-Wets Topology). In (14.2b we introduced the Attouch-Wets metric Jaw on the 
space A4 of closed subsets of C. Since it is not convenient to work with cZaw directly, we make note of the 
following simple characterization of convergence w.r.t. c?aw- Let A C C and A „ C C, n = 1,2,... be 
closed and non-empty. Then: 

(11.1) dAw(A n , A) —> 0 if and only if d/c (A n , A) —> 0 for any compact KcC, 
where 

(11.2) d/c(S, T) = max < sup d(s,T), sup d(t, S')}, 

Uesn/c tern/c J 

where we use the convention that sup se 5 n ^ d(s, T) = 0 if S fl 1C = 0. We refer to | 6 ] Chapter 3] for details 
and further discussion. Equivalently, we observe that 

(A n , 4) —> 0 

(11.3) if and only if 

Vi5 > 0, K. C C compact, 3N s.t. Vn > A, A n n /C C Ms (A) and inKc Ns(A n ) 

where Afs(X) is the usual open ^-neighborhood of the set X. In this section we will simply use the notation 
A n -A A to denote this convergence, since there is no room for confusion. 

11.1. The case of bounded potential V. 
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11.1.1. Proof of Theorem \4.2I Before we embark on the proof of the theorem the reader unfamiliar with 
the concept of Halton sequences may want to review this material. A great reference is ED (see p. 29 for 
definition). We will also be needing the following definition and theorem in order to prove Theorem l4.2l 


Definition 11.3. 

to be 


Let {t\,... t]\r} be a sequence in [0, l] d . Then we define the star discrepancy of{t\A n} 




sup 

ife/C 


1 

~ v(K) 


where K, denotes the family of all subsets of [0, l] d of the form IIl-ilP’ bk), Xk denotes the characteristic 
function on K, bk £ (0,1] and v denotes the Lebesgue measure. 


Theorem 11.4 ( (70)). If {tfc}fceN is the Halton sequence in [0,1 ] d in the pairwise relatively prime bases 
b \,..., bd, then 


(11.4) 




d_ 

N 



1 

21og(6 fc ) 


log(iV) + 


bk + 1 ^ 


N £ N. 


For a proof of this theorem see (7Q), p. 29. Note that as the right hand side of (II 1.41) is rather cumbersome 
to work with, hence it is convenient to define the following constant. 


Definition 11.5. Define C*[b \,..., bf) to be the smallest integer such that for all N £ N 


d 

N 


1 d 


where b\ 


N fc=i v 21 °g( & fe) 

, bd are as in Theorem U 1 ,4\ 


bk — 1 , , , n bk + 1 

log (N) + —— 


< C*(bi,..., bd) 


log (N) a 

w~ 


Further to these definitions, we shall require a Gabor basis which is the core in the discretisation carried 
out to produce the tower of algorithms. In particular, let 

(11.5) ip k j(x) = e 2mkx X[o,i](x-l), k,l£Z. 

It is well-known that ipk,i form an orthonormal basis for L 2 (R). Thus, by applying the Fourier transform, 
(11-6) {V’fci.ti ® i>k 2 ,i 2 ® ® 'fikd.id ■ ki,h, ...,k d ,ld& Z} 

forms an orthonormal basis for T 2 (R d ) since the Fourier transform T is unitary. Let { }jeH be an enumer¬ 
ation of the collection of functions above, define 


(11.7) 5 = span{<^}j SN 
and let 

(11.8) 6 : N 9 j i —> (fci, li ) x ... x ( kd , Id) € Z 2d 

be the bijection used in this enumeration. Define 

k(m,d ) :=max{|)c p | : {k p ,l p ) = 0(j) p ,p £ {1 £ {l,...,m}}, 

(11.9) 

l(m,d) := max{| l p \ : (k p ,l p ) = 6(j) p ,p £ {l,...,d},j £ {1,.. .,m}}, 

and let 

( 11 . 10 ) 


Ci(m, d, a) := d 2 4 


l2 | (max{Z(m, d) 2 + l(m, d) + 1/3, l }) 2 


m,d,a £ N, 


| a — k(m , d)\ + 1 J 

( 11 . 11 ) C 2 (m, d) := 2 d ( 2 (([(to, d) + l) 4 + Z(m, d) 4 ) 2 (2(fc(m, d) + 1) + 2)) * , m, d £ N. 
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The quantities Ci(m, d , a) and C^to, d) may seem to come out of the blue. They stem from Lemma fl 1.71 
and Lemma fl 1.8 1 that are technical lemmas needed in order to construct the tower of algorithms. However, 
Ci (to, d, a) and C 2 (to, d) occur in the main proof and thus it is advantageous to introduce them here to 
prepare the reader. 

Remark 11.6 (Assumptions on A). As mentioned in Remark |4~T1 we will now specify the assumption on 
the constants in A. In particular, A will include 

{0(j) P : P < d,j £ N} U {C*(bi,b d )} U {log(km)}kLi U {<A(fc)}£°= i, 

where <f> is the function describing the bound on the local bounded variation in (14.31) . Moreover, A will also 
include 

(11.12) |^r i (0;eeM, fc,(ez, s = o,2 

A Q 2 1 

Note that it is easy to derive closed form expressions for ipk,i and ' , and these expressions will be 

variations of products of exponential functions and functions of the form x l/x p for p = 1,2,3. For any 
of the general algorithms T : Q. —> A4 (where O is the appropriate domain), used in the lowest level of the 
tower, will satisfy assumption II in Remark f2.1 II In particular, the constant functions in Ap( A) are the same 
for different inputs A. B £ V.. 

With these preliminaries we are now ready to prove Theorem l4.2l which will be done in several steps. 

Proo/o/SCI(Si, 0 2 ) a = 1. We split the proof of this upper bound into several steps to simplify its presen¬ 
tation. In the first two steps we define the tower of algorithms containing Y m . In the third step we show that 

lim m _^oo T m — -4.1. 

Step I: Defining r m ({V r p } pg A rm (t/)) and Ar m (V). To do that recall S from (II 1.71 ). Note that since 
T>(H) = W 2,2 (R d ) it is easy to show that S is a core for H. Let P m , to £ N, be the projection onto 
and let z 6 C. Define 

S m {V,z) := {-A + V-zI)P m and S m {V,z) := (-A + V - zI)P m . 

Let 

<Ji{S m {V,z)) :=min {((S m (V, z)f, S m (V, z)f)) 1 * : / e Ran(P m ), ||/|| = 1} 
and ai(S m (V, z)) := min{((5 m (H, z)f, S m (V, z)f))i : f £ Ran(P m ), ||/|| = 1}, and define 

(11.13) 7 m{z) := min{(Ti(S' m (y, z)), ai(S m (V, ^))}}- 

Note that if we could evaluate at any point z using only finitely many arithmetic operations of elements of 
the form V{x), x £ we could have defined a general algorithm as desired by using ( 7 ™,) where 

Tg“o) is defined in (IIP.II ). Unfortunately, such evaluation is not possible ( 7 m may depend on infinitely 
many samples of V), and we will now focus on finding an approximation to 7 m . 

Let S = {ffejfeGN- where tk £ [0, l] d is a Halton sequence (see f70l p. 29 for definition) in the pairwise 
relatively prime bases 61 ,..., b d (note that the particular choice of the b 3 s is not important). Define, for 
a > 0 and N £ N 

(0n)d N _ 

(11.14) { f,g) a N = ^J-J2f a (tk)g a (tk), f,geL 2 (m. d )n BV loc (K d )), 

k=\ 

where we have defined the rescaling function on [ 0 , I \ rl by 

(11.15) f a = /(a(2 • —1),..., a(2 • —1))| [ 0 i 1 ] <j, 

(we will throughout the proof use the superscript a on a function to indicate (II 1.151 )). where BVi oc (R rf )) = 
{/ : TV(/|[_ h b ]d.) < 00 , V 6 > 0} and TV(/|[_b,b]< i ) denotes the total variation, in the sense of Hardy 
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and Krause (see ll70l ). of / restricted to [—b 7 b] d . Note that since V G L°°(R d ) D BVi oc (R d ) and any 
/ G Ran(P m ) is smooth we have that S m (V,z)f G L 2 (K d ) fl BVi oc (R <1 )). Hence, we can define for 
n, m G N 

cri,n(S m {V, z)) := m.in{((S m (V, z)f, S m (V, z)f) n N ( n) )i : f G Ran(P m ), ||/|| = 1} 

(11.16) „ „ 

0-1 ,n(S m (V, z)) := min{((S' m (y, z)f, S m (V, z)f ) n<N ( n) ) 2 : / G Ran(P m ), ||/|| = 1}, 
where N(n) [n</>(n) 4 ] and where <f> comes from the definition of f2i. Let 

(11.17) (m(z) ■= min {k/m : k G N, k/m > z)), 2 ))}}, 

(11.18) n(m ) := minfn : f(m,n ) < — 5 -}, 
and 

f (to, n) := (m + 1 )toCi (to, d, n) 

(11 19) + (w 2 + crV(n) + 2 (ctto + 1 )(</>(ra) + 1 )) 

x (l + o - 2 + 2cr) C 2 (rn,d)C*(bi,... ,b d ) ^ 0S ^^ n ^ , N(n ) = |"?t</>(n) 4 ], 

iV (n) 

a = 3 d — 2 d+1 + 2, Ci( to, d, n) is defined in (II 1.10b . C 2 (to, d) is defined in (II 1.1 lb and C* ( 61 ,..., bd) 
is defined in Definition II 1.51 First, note that the choice of N{n) in ( II 1.19b implies that f(m,n) —> 0 as 
n —> 00 . Thus, n(m) is well defined. Second, note that it is clear that f, and hence also n(m), can be 
evaluated by using finitely many arithmetic operations and comparisons from the set 

(11.20) Ai = {9{j) p : p < d,j < to} U {C*(b l7 ..., b d )} U {\og{k(/){k))} r k=1 U {cj>(k)} r k=1 , 

where r is some finite integer depending on m. Recall from Remark fl 1 , 6 l that we have that Ai C A. 

The function f may seem to come somewhat out of the blue, however, it stems from certain bounds in 
(II 1.41b (see also (II 1.42b ) on errors of discrete integrals related to (II 1.16b . We can now define 

r m(v) = rmdLpJpeArm)^)) := ^B m ( 0 )(Cm), 

where is defined in ( 1 10 . lb and 

(11.21) Ar m (R) = {p x : x G L m } U Ai U A 2 . 

Here {p x : x G L m } is the set of all point evaluations p x {V) '■= V ( x) at the points in 

L m := {{n{2t k ,i - 1),..., n(2t M - 1)) : k = 1,..., N(n) = \ 'n^(n) 4 ], n = n{m)}, 

where t k = {tk,i, ■ ■ ■, tk,d}, n = n(m) is defined in (II 1.18b and A 2 is a finite set of constant functions that 
will be determined in (II 1.27b in Step II. 

To show that this provides an arithmetic tower of algorithms for Si note that each of the mappings 
V | —> I’m(L) is an algorithm as desired for arithmetic towers of algorithms. Indeed, Ap m (V) is finite and 
does not depend on V, hence we have a non adaptive tower. Moreover, the computation of t ((«,.) 
requires only finitely many evaluations of ( m , hence it suffices to demonstrate the following. 

Step II: For a single :GC, the evaluation of £ m (z) requires finitely many arithmetic operations of the 
elements {V}}peA rm (v)- To see this we proceed as follows. For zGC, form the matrices Z m (z), Z m (z) G 
C mxm by considering the orthonormal basis {} :;( =n constructed in the beginning of Step I. More precisely, 

Z m (z)ij = (S m (V,z)ipj,S m {V,z)(pi) n<N , i,j < TO, 

Z m (z)ij = {S m (V, z)ifij,S m {V, z)ipi ) n , N , i,j < to, N = N(n ) = |"n^(n) 4 ], 


( 11 . 22 ) 
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where n = n(m) is defined in (II 1.18b . Note that forming Z m (z)ij and Z m {z)ij require only finitely many 
arithmetic operations and radicals of the elements {fp} pg A r (v)> where we will now specify A 2 in (II 1.21b . 
Indeed, 

Z)(pj ■ A m ( V, n,N = (A( Pj 5 Pj ; ^Pi)n : N Pj ■> V Pi)n,N 

(11.23) + (Vipj,Vipi) n , N - 2'8t(z)({Aip j , <Pi) n , N 

T (Vpj j Pi)n,N') "1“ \z\ {jPjiPi)n,N- 


for i, j < m. Observe that for s,t £ {0,1}, s, t £ {0, 2} and g £ {V, V, \V\ 2 } it follows that 


(11.24) 


(g AV^AVikiv = ^E( 9 n (t k ) £ 


N 


N 




i, j < m, 


fc =1 


pG-t>(s),ge$(t) 


hi,j,p,q{ x ) l 1 p9(j)i{ x l) ' ' ' Q x s ( x p) ' ' ' ( x d) 


(11.25) 


X I ^e(ih( x 1) ■■■ d g e J )q {Xq) • • • Mi)A x d) I , 


(11.26) 


m 


{l,...,d}, f=l 

{1}, t = 0, 


where s = 2s and t = 2 1. Note that because of the choice of in ( II 1.5b we have explicit formulas 
for -ipg(j) p and q°s j)p that are variants of exponential functions. Thus, since n(m) can be evaluated with 
finitely many arithmetic operations and comparisons of the elements in Ai, and by (II 1 .241) . (II 1.251) and 
(II 1.261) . it follows that (jA s (pj,A f (pi)n,ff can be evaluated by using finitely many arithmetic operations 
and comparisons of elements in {p{V) : p £ Ar m (H)} where Ar m (H) is defined in (111.211) and 


(11.27) 


A 2 = 


9 ~ s Mj) p 

dxl 


£ L m , 1 < j < m, 1 < p < d, s = 0,2 


(As discussed in the assumption in Remark 12. 11 1 we treat the numbers in A 2 as constant functions on il). 
Hence, it follows that forming Z m {z)ij requires only finitely many arithmetic operations and comparisons 
of the elements {lfo} pg Ar m (v)- The argumentfor Z m (z)ij using (S rn (V, z)ipj, S m (V : z)ipi) a ,N is identical. 

When Z m (z) and Z m (z) are formed, we proceed as follows in order to compute ( m (z). For k £ N, we 
start with k = 1 , then: 

• Check whether mm{ai(Z m (z)), ai(Z m (z))} < k/m. 

• If not let k = k + 1 and repeat, otherwise ( m (z) = k/m. 

Note that the first step requires finitely many arithmetic operations of {Z m (z)ij}ij< rn and {Z m (z)ij}ij< m 
by Proposition 110.11 and the loop will clearly terminate for a finite k and thus compute ( rn (z). Hence, 
we have proven the assertion that the evaluation of ( m (z) requires finitely many arithmetic operations and 
comparisons of the elements {Vp} p( z k Tm (v) and we conclude that F m are general algorithms as desired for 
arithmetic towers of algorithms. 

Step III: Finally, we show that r m (H) — > Hi (V'). as n — > 00 . Note that, by the properties of the Attouch- 
Wets topology, and as discussed in Remark fl 1 .21 it suffices to show that for any compact set K. C C 


(11.28) 


tfc(r m (H),Ei(F))—k 0, n -)> 00 , 


where die is defined in (II 1.21 ). To show (II 1.28b we start by defining 

7 ( 2 ) := min { inf{||(—A + V — zl)ip\\ : t/> G W 2 ’ 2 (R d ), ||^|| = 1}, 

inf{II(—A + V — zl)ip\\ : ip £ W 2 ’ 2 (R rf ), U\\ = 1}} = ||(-A + V - zl)- 1 U" 1 , 


(11.29) 
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where we use the convention that ||(—A + V — z /) _1 || _1 = 0 when z £ sp(—A + V) and proceed similarly 
to the proof of Theorem 13 .3 1 with the following claim. Before we state the claim recall h from the definition 
of (£) in Step I of the proof of Theorem l3.3l 

Claim: Let A3 C C be any compact set, and let K be a compact set containing A3 such that sp(- A + 
V) (T K 0 and 0 < <5 < e < 1/2. Suppose that £ is a function with ||£ — 7 ^ ^ := ||(£ — 7 )Xxl|oo < e 
on K ■= (K + Sh(diam(A:)+ 2 €)+e( 0 )) I where \k denotes the characteristic function of K. Finally, let u be 
defined as in (110.41 ). Then lim^o u(£) = 0 and 

(11-30) d K (T^(C),s P (-A + C))<u(e). 

To prove the claim, we first show that 

(11.31) sup dist(s, sp(—A + V)) < u(e). 

aeT^(C)n/c 

If T s k (£) n A3 = 0 then there is nothing to prove, thus we assume that Y 1 ^ (£) (T A3 0. Let z £ G S (K ) 
and recall G S (K), hg and I z = (T (<S(Z + iZ)) from the definition of Yj<.(£) in Step I of the 

proof of Theorem 13.31 Notice that we may argue exactly as in (110.51 ) and deduce that I z C K. Suppose 
that M z ^ 0. Note that by (14.41) . the monotonicity of h, and the compactness of sp(—A + V) D K ^ 0 
there is a y £ sp(—A + V) of minimal distance to z with \z — y\ < h('y(z)). Since ||£ — 7 H ^ < e, and 
by using the monotonicity of h, we get \z — y\ < /i(£(z) + e). Hence, at least one of the v € I z , say vq, 
satisfies |i>o — y\ < h(((z) + e) — /i(£(z)) + 25. Thus, by noting that 7 ( 110 ) < dist(uo, —A + V), and by 
the assumption that 5 < e, we get £( 70 ) < 7 ( 110 ) + e < /i(£(z) + e) — /i(£(z)) + 3e. By the definition of 
M z , this estimate now holds for all points w £ M z . Thus, we may argue exactly as in (110.61 ) and deduce that 
dist(w, sp(.A)) < /i(/i(£(z) + e) — h(£(z)) + 3e) which yields (II 1.31b . To see that 

(11.32) sup dist(Y|-(£), t) < u(e), 

t£ sp(—A+y)n/c 

(where we assume that sp(—A + V) D A3 ^ 0) take any y £ sp(—A + V) D A3 C K. Then there is a point 
z £ G S (K) with \z — y\ < 5 < e, hence £(z) < 7 (z) + e < dist(z, sp(—A + V”)) + e < 2e < 1. Thus, M z 
is not empty and contains a point which is closer to y than ft,(£(z)) + e < h(2 e) + e < u(e), and this yields 
(111.321 ). The fact that lim^o u(£) = 0 is shown in Step II of the proof of Theorem l3.3l and we have proved 
the claim. 

Armed with this claim we continue on the path to prove (11 1.281 ). We define 

(11.33) 7 m,n(z) := min{di ! „(S' m (l/, z)),a^ n {Sm{V,z))}. 

Then £ m = 7 m, n (m) where n(m) is defined as in (11 1.18b . By Lemma fl 1 .91 £ m —> 7 locally uniformly, 
when to — > 00. Let too be large enough so that £? mo (0) D A3. Then, for all m > mo, T m (V) D A3 = 
T// m ( 0 )(Cm) D A3. Choose K = B mo ( 0) and e £ (0,1/2) as in the claim. Then, by the claim, there is an 
toi > too such that for every to > mi, by (11 1.30b . d/c(Xm(V), Eu(V')) < u(e). Since lim^o w(£) = 0 
then ll 1 . 281 ) follows. □ 

To finish the proof of Theorem 14.21 we need to establish the convergence of the functions 7 ™, £ m and 

Lemma 11.7. Consider the functions 7 m> „ and 7 m defined in Ml. iil ) and Ml .fib respectively. Then 
r )m,n —^ 7 m , locally uniformly as n —> 00 . 

Proof. Note that we will be using the notation TV[_ a a ]d(/) = TV(/|[_ a a ]d). Let, for s,t £ {0,1}, 
i,j <m and g £ {V, V,\V\ 2 } 

I {g,A s (p j ,A t tp i )= g(x) ^2 Kj,p,q( x )dx, 

pe*(«),gea>(t) 
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where hij iPtq is defined in i ll 1 .25b and $ is defined in i ll 1 .26b (recall that {w}jgn is an enumeration of 
{M,h 2 : ki,h, ...,kd,ld G ZlfromlEJ.). Observe that by the definition of 7 m> „ 

and 7 m in ( II 1 .33b and 111 1.13b the lemma follows if we can show that 

(11.34) I(g, AVj,AVi)-— E 5 ”^) E Kj, P ,q(tk)) —>0, n-> oo, 

fc=l pG4>(s),gG$(t) 


where N = N(n) is from 111 1.22b . i,j < m, s,t £ {0,1} and g is either V,V, \V\ 2 (recall the notation 
V a from 111 1.15b ). Note that, by the multi-dimensional Koksma-Hlawka inequality (Theorem 2.11 in 1701 ) it 
follows that 


i/foAVi, avo - £>"(**) E K d , P jt k )\ 


N 


N 


(11.35) 


fc =i 


pG$(s),9G<I>(i) 


— 111? ^ ' hi t j t p,qXR(n) IIL 1 + TV[_ nr ,]d j g ^ ' hi,j,p,q I D N (ti,. . . ,tm), 


pG<t>(s),gG$(t) 


pG3>(s),g£4 > (t) 


where R[n ) = ([— n, n] d ) c . To bound the first part of the right hand side of 111 1.35b note that 


(11.36) 


where 


9 'y ] ^ijtPtqXRin) 

PG4>(s),9G3>(t) 


— Ill?ll (tr), 

L 1 


K\j(n) 


E 

pG$(s),gG$(t) 



(j)i 


d ~ s i’6(j) P 

dx s p 


•••V , e(i)J,|V i 0(i) 1 



(recall 6 from 111 1.8b ) where X([-n,n] d ) c denotes the characteristic function on ([— n, n\ d ) c . To bound iCj(n), 
note that it follows by the definition of ip^i with k, l £ Z in 111 1.5b and some straightforward integration that 

for 1 < p < d and (k p , l p ) = 9{j) p we have 


(11.37) 



( 1 

M,l p { X P ) 

VI 


^ |aip—fcp|+l 


when k p — 1 < x p < k p + 1, 

otherwise, 


(11.38) 


Hence, if 


d 2 i>k p ,i p 

dx% 



< 


;2 I ; i I 

tp -I- ip -I- 3 

(p+ip + 5 

\x p — fep|+l 


when k p — 1 < a; p < k p + 1, 

otherwise. 


k = k(m, d) := max{|fc p | : ( k p ,l p ) = 9(j) p ,p £ {1,... ,d}, j £ {1,... ,m}}, 
l = l(m,d) := max{| l p \ : ( k p ,l p ) = 9{j) p ,p £{!,... ,d},j £{!,... ,m}}, 


and n > k, then it follows that 
K 


(11.39) 


i,i( n ) ^ max {(l X({-n,n] d )4e(j)i' ■ ■ 9 ■'' Mi)* 

\Mi)i ■ ■ ■ 9 ' ’' MM ) : P e ®{s),q £ $(t), s, t £ {0,1}| 

<Ci(m,d,n), Ci( to, d, n) = d 2 ( 4- 


2 , (max{! 2 + ( + 1/3, l}) 2 


■ — fc| + 1 
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To bound the second part of the right hand side of (II 1 .35b observe that, by Lemma fl 1.81 we have 


TV[_ n , n] - g n h lj,v, q 

\ pe4 > (s),g£ < t'(t) 

(11.40) < d 2 (||</|| 00 ||^-, p ,,|| 00 + a 2 TV [ _„ i „ ] ,( 5 )TV [ _ riin]d (/i iJ , p , 9 ) 

T tr (TV[_„ in ]d (g) || hij t p,q || oo + TV[_ n rl ]d (hi,j,p,q) Halloo) ) 
<max{||y|| 00 ,||l/ 2 || 00 ,TV [ _ nin]d (^),TV [ _ nin]d (|y| 2 )}(l + a 2 + 2a)C' 2 (m,d), 


where cr = 3 d — 2 d+l + 2 and C 2 (m, d ) is defined in (111.1 II) . Thus, by (II 1.35l >. (II 1.36b . (II 1.39l i. (Ill .40b . 
Lemma fl 1.8 1 and Theorem ll 1.4l (recall that {tk} kan is a Halton sequence) we get 


\I(g, AVi.AVi) - E Kj, P , q (tk) I 

k— 1 p£<3>(s),g£<I>(£) 

<max{||y|| 00 ,||F||^}C' 1 (m,d,n) + ma X {||y|| 00 ,||'L 2 || 00 ,TV [ _ ni „ ] ,(F),TV [ _„ i „ ]d (|y| 2 )} 


x (1 + CT 2 + 2 cr) C 2 (to, d) ( ^ f , log(iV) 1 ^ + 1 


(11.41) 

V ( 1 -L /t^ -U 9/t^ (m rl\ I 

E 1 2 

< T(||V||oo,m,?i), 

where the last inequality uses the bound on the total variation of V from (14.31) and 

T(||V||oo,m,n) := (||V||oo + 1 )|| V||ooCi(to, d, n) 

(11.42) + (Halloo + 0 ’ 2< / >2 ( n ) + 2 (o'||f / || 00 + l)(<j)(n) + 1 )) 

x (l + a 2 + 2a) C 2 (rn,d)C*{b 1 ,... ,b d ) ^° g ^ , N(n) = \nc/)(n) A ~\ 


(recall (II 1.16b ) where C*(bi, ..., bd) is defined in Definition II 1.51 Finally, note that, by the definition of 
Ci( to, d, n) and the fact that we have chosen N(n) according to ( II 1.42b . it follows that "rdf V^Hoc, m, n) 0 


as n 


—> oo. Hence, (II 1.34b follows via (II 1.42b . and the proof is finished. 


□ 


Lemma 11.8. For all a > 0, i, j < n 2 and m,n < d: 

(i) < C 2 (to, d), 

(ii) \\K,j,m, n \\oo<C 2 (m,d), 

(iii) for g £ BVi oc (K d ) and a = 3 d — 2 d+1 +2 we have that 

TV((? hi,j,p,q) — ||fl , ||oo||^iJ,p,g||oo + O' TV[_ aa ]<i(g()TV[_ aia ]<j(/li i j i p i q) 

+ a (TV [ _ 0i0] d(5)||/l i j,p,,||oo + TV [-a,a] d (hi,j,p,q) llfi'lloo) i 

(iv) TV[_ a , a] 4(| 5 | 2 ) < II^Hto + °' 2 TV 2 _ a a] d(g) + 2CT||p|| 00 TV [ _ a a] d(g) 

where 

C 2 (to, d) := 2 d (2 ((! + l ) 4 + l A f{2{k + 1) + 2)) d , 
and k, l are defined in ( 177.9b . 

Proof. To prove both (i) and (ii) we will use the easy facts that TV(/i“ ■ ) = TV[_ a a ]<j(/ij i:;i p )9 ) and 

TV(g a /i“ J - ) = T V[- a ,a] d (ghij,p,q). To prove (i) of the claim let us first recall (see for example li70l , p. 
19) that when ^ £ C 1 ([—a, a\ d ) then 

d 

TV[_ o , a ].i(V0 = E E 

k —1 l<ii <...<ife <d 


(11.43) 
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where (V’ii,.■■,**) and 

V’h : (yi,---,yk) yj = a, j m, =yj, 


/ a na 

... / 

-a «/ — a 


<9V 


dx\...d,Xk, <f G C' 1 ([—a, a] fe ). 


ctei•••ctafc 

Note that from (II 1.251) and (II 1.51 ) it follows that /i“ ? p q G C°°([0, l] d ), so by the definition of h in ( II 1.251 ) 
we have that, for fc e { 1 ,..., d} and 1 < ii < ... < i k < d, 

y {k \Kj , P , 4 ) 


(11.44) 


— n max 

M—1 


max / 

s,t=0,2j_ n 


max 

s,t = 0,2 

x^G[—a,a] 


l ftr* a** ^ 




ds M^ (T P'Mi) 


9xf 


a®* (Xm) 


Vfc,p, q < d. 


We will now focus on bounding the right hand side of (II 1.441 ). Note that by using the definition of with 
k,l G Z in (II 1.51 ) and some straightforward integration it follows that for 1 < p < d and (k p , l p ) = 9(j) p 
we have 


(11.45) 


(11.46) 


di’k p ,l 


(Xp) 


dx p y p> 


9 3 ^k p ,i p 


( x p ) 


dxl y p> 


< 


< 


l p + \ when k p — 1 < x p < k p + \, 


l +h 

\xp— fepl+1 
((p+l) 4 -(p 


otherwise, 


when kp ~ 1 < Xp < kp + 1, 


- \ (h+ir-r P 


otherwise. 


4(|x p — fcp| + l) 

Thus, by using (II 1.371 ). (II 1.381) . (II 1.451) and (II 1.461 ) it follows that 


max 

s,t— 0,2 


f 

— 1 

/-« 

9x p \ 


/ \ ^ , X 


dxj. 


dx.j 


(11.47) 


< 2 max 

s,t—0,1,2,3 J 


ds ^ ( )^tm l (x ) 


da;?, 


dxl 


dx u 


< 2 (([+l ) 4 + [ 4 ) 2 ^ 2 (fc + l) + 


f ~2 dy 

[—oo, —l]U[l,oo] V j 


= 2((l + l ) 4 + f 4 ) 2 ( 2 (k + 1 ) + 2 ) , 

where k := max{|fc p | : (k p ,l p ) = 9(j) p ,p G {l,...,d},j e {l,...,n}}, [ := max{|Z p | : (k p ,l p ) = 
9(j) p ,p 6 {1, • ■ •, d},j G {1,..., n}}. Moreover, by (111.37b and (111.38b 


(11.48) max 

s,t = 0,2 

x M £[—a,a] 




dxf 


9x1 


< max{P + f+ 1/3,1}, i,j<rn , 1 < p < d. 


Hence, from (II 1.44b . ( II 1.47b and (II 1.48b it follows that for k G {1, ... ,d} and 1 < i\ < ... < i k < d, 

V (k) (hl j>p<q -, h,...,i k )< (2 ((i + l ) 4 + f 4 ) 2 ( 2 (fc + 1 ) + 2 )) “ 

and thus, by (II 1.43b we get that 

T< ( 2 ((f+l ) 4 + [ 4 ) 2 ( 2 (fc + l) + 2 )) d ^ (f) 

h —1 v / 


< 2 * 


fc=i 

d 


(2((f + l) 4 + P) 2 (2(fc + 1) + 2)) , 
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and thus we have proved (i) in the claim. 

To prove (ii) in the claim, we observe that by (11 1.5I >. (11 1 .25b and (11 1 .48b it follows that 


II h 


a 

i,3,P,Q 


d 


n 


max 

s,t= 0,2 


[—00,00] 


dx^ 




dxl 





for i,j<m and p, q < d. Obviously, the last part of the above inequality is bounded by C(m, d). which 
yields the assertion. 

To prove (iii) and (iv) we will use the fact (see lfT3lD that 


A = {f € M{[-a,a] d ) : ||/||oo + TV[_ a , o] d(/) < 00 }, 

where A i([—a,a] d ) denotes the set of measurable functions on [— a,a\ d , is a Banach algebra when A is 
equipped with the norm \\f\\A = ll/lloo + erT V[_ 0 ,o] d (/)i where cr > 3 d — 2 d+1 + 1. We will let cr = 
3 d - 2 d+1 + 2. Hence, we get, by the Banach algebra property of the norm and (i) and (ii) that we already 
have proved, that 

TV[_ a^ajd. {ghi,j,p,q) f || ff|| oo || II oo "T O’ TV[_ 0|0 ]d (<7)TV[_ a)0 ]rf (/lj 

+ tr (TV [ _ aia] d(ff)||/i ijjiPig || 0O + TV[_ aja ]d(/iij-p j g)||p|| 00 ] , g £ A, 

finally proving (iii). The proof of (iv) is almost identical. □ 


Lemma 11.9. Let be defined as in ( 1/ /. / 71 ). Then, Cm —^ 7 locally uniformly, where 7 is defined in 

mm- 


Proof. Let y rn be as defined in (11 1.131 ). Also, observe that y rn — y 7 locally uniformly asm -t 00 . Indeed, 
let T = {||(—A + V + xl)tp || : ip £ fT 2 , 2 (R <i ), ||V’|| = 1}- Then, since S is a core for H (recall S from 
Step I of the prof of SCI(Hi, £> 2 ) a = 1) then every element in T can be approximated arbitrarily well by 
|| (—A + V + xl)(p\\ for some if £ S, thus it follows from (111.29b that we have convergence. Note that the 
convergence must be monotonically from above by the definition of P m , and thus Dini’s Theorem assures 
the locally uniform convergence. Thus, it suffices to show that |Cm — 7m | —;* 0 locally uniformly as m —> 00 . 
Note that if we define, for z £ C, 

— (*Sm(A, Z~)fj , SmiV, tfijfff n,N ; fj A ttl, 

Zm(z)ij = {S m (V, z)f j ,Sm(V, z)fi)n,N, i,j<m, N = \n<p(n) 4 ] , 
where n = n(m) is defined in (11 1.181 ) and 

W m (z)ij = ( S m (V,z)fj,S m {V,z)fi), i, j < m, 

W m (z)ij = (S m (V,z)fj,S m (y,z)fi), i, j < m, 

the desired convergence follows if we can show that \\Z m (z)— W m (z)\\ and \\Z m (z) — W m (z)\\ tend to zero 
as m tends to infinity for all z in some compact set. However, this follows by the choice of n(m) = min{n : 
f(m,n) < -4l in (111.181 ). In particular, f(m, n) — T{m,m,n) and clearly rdlVUoo, m, n) < r{m,m,n) 
for || V11 00 A rn (recall r from (111.421 )). Thus it follows immediately by (11 1.411 ) that for z £ K C C, where 
K is compact, ||Z m (,z) - W m {z)\\ F = O(^) and || Z m (z) - W m {z)\\ F = 0(±) for sufficiently large to. 
Here || • ||f denotes the Frobenius norm, and we have shown the desired convergence. □ 

Proof of SCI(E 2 , f2i) A < 2. The proof stays close to the proof of SCI(Si, H 2 )a = 1. As before we split 
the proof into several steps, however, before we embark on the different steps recall that ^{V) = sp f (//), 
the setup in Step I of in the proof of SCI(Ei, f^A = 1 and define 7 : C —> [0, 00 ) as in (111 .29b . for m £ N, 
7 m : C —> [0, 00 ) as in (111.131 ) and, for to, n £ N, 7 m ,n ■ C —> [0, cxo) as in (111.331 ). 
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Step I: Defining r m , and Ar m> 7 l (V'). We start by defining, for A E G m := (m 1 (Z+iZ))ri-B m (0), 

5 m , n (A) := {i = m + 1 ,..., n : 7 m ,»(A) < e - — } 

m 

L49) T mA A ) := {* = m + 1 ,..., n : 7 m ,i(A) < - Li , / } 

(e — 1 /m) 1 + 1 /m 

-£'m,n(A) := |5 mi „(A)| + |T , iriiri (A)| — n. 


Recall that 'y m ,n(z) = min{( 7 i,„(S m (V; z)), CTi, n (S m (V; z))}, where cri, rl (S' m (lA, z)) and tri )n (S m (V, z)) 

are defined in (111.16t . Thus, we can define 


(11.50) 


r m,n{V) — r mjn ({V/}pg Ap m n (V)) {A £ G m \ E m ^ n (X) > 0}, 

r m (F) = lim r m ,„(y). 


To determine Ap m n (V) we observe that by the definition of Q m in ( 11 1.17b and the definition of 7 m , n in 
(11 1 .3311 the evaluation of requires the same information as the evaluation of with the exception of 
Ai in (ITTT201) . Thus, 

A r m ,„(V0 = {p x : p x {V) = V(x),x £ L n } U A, 


where 


A — s p x 


d ~ s ^eu) P 

dxl 


: x £ L n , 1 < j < m, 1 < p < d, s = 0,2>, 


L n := {(n(2f fej i - 1),.. .,n(2t k ,d - 1)) : k = 1,..., |"n</>(n) 4 ]}, t k = {t k , 1 , ■ • - ,4,d} 

and ipg(j) p is defined in (11 1.51 ). Note that showing that the limit in (111.501 ) exists is part of the proof. Note 
also that by arguing exactly as in Step II in the proof of SCI(Si, = 1 , it follows that evaluating 7 m ,n 
requires finitely many arithmetic operations of the elements in { V f ,} f) c a, (V) ■ Hence, it is clear that T m , 
L m ,n form an arithmetic tower of algorithms. To finish we only need to show the following. 

Step II: We have that 


(11.51) 


rm,n({^p}/j£Ar m| „(V)) ^ L' m {V), Tl > OO, 


(11.52) 


r m (V) — >E 2 (V), m —> 00 . 


We start by showing that T mtn (V) = T m ,i{V) for all large n and l, thus, the limit exist and we have 
(11 1.511) . To see this, we start by claiming that for each A £ G m then either /i’ m n (A) < 0 for all large n or 
E m ,n{ A) > 0 for all large n. Observe that if the claim holds this implies that r m>n (V) = r m ;(!A) for all 
large n and l. To prove the claim consider the three cases: 


(1) : 7 m (A) < 


(e — 1/m) -1 + l/r 


( 2 ): 


(e — 1/m) -1 + I /7 


< 7m (A) < 


1 

m’ 


(3) : e —— < 7m(A). 
m 

In all cases it is the locally uniform convergence 7 m ,n —> 7 m established in Lemma fl 1 ,7l that is the key. We 
start with Case 1: In this case 7 m ,n(A) < ( e _ 1 / m ) L -i +1/ / m for large n, so|5 TOi „(A)| = n-c\ and|r mi „(A)| = 
n — C 2 , ci,C 2 £ N, yielding the claim. Case 2: This case has two sub-cases. If = 7 m(A) 

then for large n, 7 m,n(A) < e — A so |jSm )Tl (A)| = n — c for some positive integer c. Now, either |T m , n (A)| 
stays constant for large n or it grows. In either case E m}n (A) = |T m>n (A)| — c cannot change sign for large n, 
thus, the claim holds. When (e _ l/m jLi +l/m < 7m (A) < e - A then < 7m, n (A) for large 

n, so |T m ,n(A)| = r, r £ N for large n. Note that |S' m , n (A)| — n cannot change sign for large n, thus, the 
claim holds. Case 3: In this case e — A < 7 m ,n(A) for large n. Thus, |<S' 77 T.,n(A)| = <7 and |T m ,n(A)| = c 2 
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for large n, yielding the claim. As a side note, we observe that due to reasoning in the three cases above we 
have that 


(11.53) 


r m (V) — { Z £ Grn ■ ^ 


(e — 1 /to) -1 + I/to 

L C {z £ Gm : 7 m(z) <€ - 1/m}. 


}U L, 


To see (111 .52b we observe that, by definition, 7 ( 2 :) < 7 m (z) for all z £ C. Thus, we immediately 
have, by (111.53b . that T m (V) C 52 (V) = cl ({2 : 7 (z) < e}). As we are considering the limit in the 
Attouch-Wets topology we are only left with the task of showing that for compact set K C C and <5 > 0 
then £ 2 (V) D K C A//(T m (l/)) for large m. We assume that 52 (V) D K 0 otherwise there is nothing 
to prove. To prove the assertion we argue by contradiction. If the statement is untrue, then there is a strictly 
increasing sequence {mfe}fc 6 N C N such that there exists a sequence {.Zm fc }fceN C 52 (V) D K such that 
z mk </ Jf s {Tm k (V)). By compactness of ^(V) (T K, and by possibly passing to a subsequence, we have 
that Zm k —> z for some z £ 5 2 (V) fl K, however, z ■Afs/ 2 (J'm k (V)) for large k. Since z £ 5 2 (V) D K 
there is a 27 £ C such that 7 ( 2 : 1 ) < e and \z — z\\ < 6/ 4. Moreover, by continuity, 7 (z) < e on the closed 
ball B e (z\) for some e < <5/4. In particular, let s > 0 be such that e — s = max, gSe ( 2l ) 7 ( 2 ). Note 
that by the fact that -A 7 point wise monotonically from above, and hence locally uniformly (by Dini’s 
theorem), it follows that 7 mk {z) < e — si for all large k and 2 £ B e (zi) where 0 < si < s. Note that for 
large k it follows that G mk D B e (z\) ^ 0 and that ( - e _ 1 ^ mfc - ) 1 -i_ ) _ 1 y mfc > e — Si. Then, for all large k, there 
exists z mk £ G mk n B e {zi) such that 7 m k (z mk ) < (e _ 1 /mfc) 1 -i +1/mfc , thus, z mk £ T mk (V). Also, since 
Zm k £ B s {z\) and \z — z±\ < <5/4, it follows that | z m — z\ < <5/2. In particular, z £ Afg/ 2 (Tm k (V)) for 
large k, and we have reached the contradiction. □ 


Proof of SCI(5 1 ,f2 1 ) A < 2 . The proof is very similar to the proof SCI(S 2 , f 2 <) a < 2, however, there are 
important details that differ, so we include the whole proof. We start by defining 7 „ = Q n where Q, is as in 
Lemma [1 1.9 1 and note that 7 „ —> 7 locally uniformly. 

Step I: Defining T m and T m ,n- We start by defining, for A £ G m := (4 _m (Z + iZ)) D B m { 0), 

Sm,n{ A) := {* = m + 1 ,..., n : 7 ,(A) < —} 

m 

T m ,n{ A) := {i = to + 1 ,... ,n : 7 ,(A) < — 

TO + 1 

Now, we can define -E m ,n(A) as in (111.49t as well as T m,n{{Vp\ P eK Tm n (v)) an h r m (V) as in (111.501 ). 
where we will show that the limit in (11 1.501 ) exists. Note that, by the definition of it follows that Ar m „ {V) 
is exactly as in ( 11 1 . 211 ). 

Step II: We have that 

(11.54) r m n (}Vp}p gi Y r ^ ^(v)) > r m (V), tl y oo : 


(11.55) T m (V) — > 3i(V), ui A cxd. 

We start by showing that r mi „(V) = T m ;(y) for all large n and /, thus, the limit exist and we have 
(11 1.541) . To see this, we start by claiming that for each A £ G m then either E ra n (\) < 0 for all large n 
or E m ^ n { A) > 0 for all large n. Observe that if the claim holds this implies that r m rl (lA) = r mj z(V) for 
all large n and l. To prove the claim consider the three cases: (1) 7(A) < ^ 77 -, (2) ^ 77 - < 7(A) < 77 
and (3) 77 < 7(A). In all cases it is the locally uniform convergence 7„ —► 7 that is the key. We start 
with Case 1: In this case 7„(A) < 77777 - for large n, so | jS' Tri ., Tl (A) | = n — C\ and |T m)n (A)| = n — C 2 , 
Ci,C 2 £ N, yielding the claim. Case 2: This case has two sub-cases. If 77777 - = 7(A) then for large n, 
7„(A) < 777 so |5 ro , n (A)| = n — c for some positive integer c. Now, either |T m) „(A)| stays constant for 
large n or it grows. In either case £ , m , n ( A) = |T , m) „(A)| — c cannot change sign for large n, thus, the claim 
holds. When 77777 - < 7(A) < 777 then 777 ^ 7 - < 7„(A) for large n, so |T m>n (A)| = r, r £ N for large n. Note 
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that | S rn n (A)| — n cannot change sign for large n, thus, the claim holds. Case 3: In this case A < 7n(A) 
for large n. Thus, |S m)Tl (A)| = Ci and |r m>n (A)| = C2 for large n, yielding the claim. As a side note, we 
observe that due to reasoning in the three cases above we have that 

(11.56) r m (F)={z£G m : 7 (z)<—y—}U £, L c {z € G m ■ 7 <a) < —}. 

to + 1 to 

To show (11 1.551) . let <5 > 0 and K C C be compact, we need to show that T m (y) IT K C ■A/$(Hi(V’)) and 
E 1 (V)r\K C N'siJ'miV)) for large to. We start with the second inclusion. We assume that Si (V) D K ^ 0 
otherwise there is nothing to prove. Let to be so large that 4~ m < S and K C B m ( 0). Let z £ Si(V) D K. 
Then there is a z\ £ G m so that \zi — z | < 4~ m < S. Moreover, 

zi £ {z : dist(z, sp(-A + V)) < 4 _m } C {z : 7 (z) < j-}. 

In particular, z\ £ T m (V). Thus, since \zi — z\ < S it follows that £i(V) (T K C Afs(Xm(V)) as desired. 
As for the other inclusion, observe that in view of (11 1.561) and since Si (V) = {z : 7 (z) = 0} it suffices to 
show that we may choose an e > 0 such that {z : 7 (z) < e} (T K C Afs{{z ■ 'y(z) = 0}). Suppose not. Then 
there exists a sequence {e m }, such that e m —> 0, and a sequence { z m } such that z m £ {z : 7 (z) < e m } (T K 
but z m ^ Ms{{z : 7(2) = 0}). By possibly passing to a subsequence we may assume that z m —> zo and 
note that zo £ Afs{{z : 7 (z) = 0}). However, by continuity of 7, 7 (z m ) 7 (^ 0 ), an d 7 (z m ) 0, hence 
7(2:0) = 0 yielding the contradiction. □ 

11.2. The case of unbounded potential V. In this section we prove Theorem l4.5l on the SCI of spectra and 
pseudospectra of Schrodinger operators with unbounded potentials. Let us outline the steps of the proof first: 

a. Compactness of the resolvent: The assumptions on the potential imply that the operator H has a 
compact resolvent R{H 1 z) (see Proposition ! 1 1.201) . Therefore the spectrum is countable consisting 
of eigenvalues with finite dimensional invariant subspaces. 

b. Finite-dimensional approximations: The main part of the proof centers around showing that it is 
possible to construct, with finite amount of evaluations of V, square matrices //,, whose resolvents 
(when suitably embedded into the large space) converge to R(H , z 0 ) in norm at a suitable point 
2:0 (see Theorem II 1.221 . Note that this technique is very different from what we used and is only 
possible due to compactness. 

c. Convergence of the spectrum and pseudospectrum: We use the convergence at zq to show conver¬ 
gence at other points z in the resolvent set. 

As the argument is otherwise independent of the particular set-up, we start with a general discussion. 
In the end we demonstrate the construction of the matrices H n and the convergence of the resolvents. We 
assume the following: 

(i) Assumptions on the operator A: Given a closed densely defined operator A in a Hilbert space 
FL such that at zq £ C the resolvent operator R(z 0) = (A — zo) -1 is compact R(zq) £ K(fH). Thus 
sp(A) = { A, }, the spectrum of A, is at most countable with no finite accumulation points. 

(ii) Assumptions on the approximations A n : Suppose A n is a finite rank approximation to A such that 
if E n is the orthogonal projection onto the range of A n , then A n = A n E n . We put further 7~L n = E n H 
and denote by A n the matrix representing A n when restricted to the invariant subspace 'H n w.r.t. some 
orthonormal basis. Now, take the resolvent ( A n E n — zE n ) -1 of this restriction, extend it to Hf by zero, 
and denote this extension by R n {z). Then R n {z) = R n (z)E n , and R n {z) = (A n — z) _1 + (I — E^z -1 
for all z 0 for which the inverse exists. Finally we assume that R n {zf) exist and 

(1L57) ||f?„(2:o) - i?(2:o)|| —» 0, n —»• 00. 

Convergence of the spectrum and pseudospectrum. The first step is to conclude that if the finite rank 
approximations to the resolvent converge in operator norm at one point zq, then they also converge locally 
uniformly away from the spectrum of A. To that end denote by U r {p) the open disc at center p and radius r. 
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Proposition 11.10. Suppose R{z) and R n (z) are as above and satisfy (II 1.57b . Let K. C C be com¬ 
pact, r > 0 and define lC r = K, \ [J ? U r (Xj). Then for large enough n, R n (z) exists for all z £ IC r and 
su Pze(t r II Rn(z) — 12(2)|| —>• 0 as n —> oo. 

Proof Clearly R(z) = R(z 0 )(I - (z - zo)R(zo))- 1 and R n (z) = R n (z 0 )(I - {z - z 0 )R n (z 0 ))~ 1 for all 
z in which R(z), resp. R n (z), exist. By (II 1.571) it suffices to prove the existence of R n {z) and 

sup ||(/ - (z - z 0 )Rn(z 0 ))~ 1 -{I-(z- 2o)f2(2 0 )) _1 || 0. 

ZfE/Cr 

However, we know that (J — (z — zo)f?(^o)) _1 is meromorphic in the whole plane and hence analytic in 
the compact set JC r and in particular uniformly bounded. But this means that it is sufficient to show that the 
inverses converge, which in turn is immediate from (II 1.57b since 

sup ||(I - (z - z 0 )R n (z 0 )) — (1 — (2 — 20 ) 12 ( 2 : 0 ))II = sup \z - z 0 1 ||f?„(2o) - f?(~o)||- 

•Z(=/C r 2(E/C r 

To see that this suffices, write T n (z) = (I — (z — zf)R n (zf)), T{z) = (I — (z — zq)R(zo)) and 

T n (z) = T(z)[I + T(z)- 1 (T n (z)-T(z))]. 

Then for large enough n and 2 £ KL r by a Neumann series argument 

WTn(z)- 1 - T(z)- 1 !! < ||T( 2 )- 1 || [(1 - ||T( 2 )- 1 ||||T ri ( 2 ) - T(^)ll)- 1 - 1], 

□ 


Proposition 11 . 11 . Let K, C C be compact and <5 > 0 . Then, for all large enough n, 

sp(H)n/C c J\fs(sp(A n )), sp(H„)n/C c J\fs(sp(A)). 

Proof Since the eigenvalues are exactly the poles of the resolvents, the claim follows immediately from the 
previous proposition. □ 

The last proposition gives the convergence of the spectra. The discussion on pseudospectra is somewhat 
more involved. We need to know that the norm of the resolvent is not constant in any open set. The following 
is a theorem due to J.Globevnik, E.B.Davies and E.Shargorodsky which we formulate here as a lemma: 

Lemma 11.12 (1 421 and [ 3 l]|). Suppose A is a closed densely defined operator in 'H such that the resolvent 
R{z) = (A — z )^ 1 is compact. Let Q C C be open and connected, and assume that, for all z £ Cl, 
||i?(2)|| < M. Then, for all 2 £ Cl, ||1?(2)|| < M. This is particularly true if PL is finite dimensional. 

The theorem in ED is formulated for Banach spaces X with the extra assumption that X or its dual is 
complex strictly convex, a condition which holds for Hilbert spaces. The case TT being of finite dimension is 
already settled by 02). We put 7 ( 2 ) = 1 /|| 12 ( 2 )|| and 7 ^( 2 ) = l/|| 12 ra ( 2 )|| and summarize the properties of 
7 and 7 n as follows: 

Lemma 11 . 13 . If (i) and (ii) hold, then 7 n {z) —> 7(2) uniformly on compact sets. Neither 7, nor 7 n is 
constant in any open set and they have local minima only where they vanish. Additionally, 

(11.58) 7(2) < dist(z, sp(.A)). 

Consequently, 

sp e (H) = {2 : 7(2) < e} = cl{2 : 7(2) < e}, sp e (H n ) = {2 : 7„(z) < e} = cl{2 : 7 n (z) < e}. 

Proof. Observe first that (II 1.58b is just a reformulation of a general property of resolvents. Next, notice that 
|| 12 n ( 2 )|| = ||12(j4„, 2 )|| and that the norms of resolvents are subharmonic away from spectrum and therefore 
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7 and y n cannot have proper local minima, except when they vanish. Furthermore, they cannot be constant 
in an open set by Lemma fl 1.121 

To conclude the local uniform convergence, let M be such that along the curve {|z| = M} there are 
no eigenvalues of A and choose 1C as the set {|z| < M}. Choose any e, small enough so that the discs 
{| z — Aj|} < e/3 separate the eigenvalues inside 1C. By Proposition 1 11.1 01 we may assume that n is large 
enough so that for z € /C e / 3 (recall lC r from Proposition ! 11.1 Ob we have I 7 n (z) — 'y(z)\ < e/3. On the other 
hand, if |z — X :) < e/3 then 7 (z) < e/3 and, since has to vanish also somewhere in that disc, we have 
7 n (z) < 2e/3 in that disc, hence I 7 ra (z) — "i{z)\ < 7 „(z) + 7 (z) < e. Thus we have I 7 „(z) — 7 (z)| < efor 
all z £ /C. 

Finally, to justify the equivalence of the characterizations of pseudospectra just notice that the level sets 
{z : 'y(z) = e} and {z : 7 ™(z) = e} cannot contain open subsets or isolated points. □ 

Lemma 11.14. Assume p n and c p are continuous nonnegative functions in C which have local minima only 
when they vanish, are not constant in any open set and ip n converges to ip uniformly in compact sets. Set 
S := {z : <p(z) < 1} and S n := {z : <p n (z) < 1}. Let K, be compact and S > 0. Then the following hold 
for all large enough n 

(11.59) SnlC C JVs(S n ), S n fMC C Af s (S). 

Proof. Consider the first part of (111 .59b and assume that the left hand side is not empty. Due to compactness 
of S fl K, there are points z 4 £ S fl K, for i = 1,..., m such that S fl /C C U^i ^S/ 2 { z i). Notice that 
p{zi) < 1. If p(zi) < 1, set yi = Zi. Otherwise, p{zf) = 1, in which case Zi cannot be a local minimum, 
but since <p is not constant in any open set, there exists a point j/j £ U&ii(zi) such that pfyf) < 1. But 
since p n converges uniformly in compact sets to p we conclude that for all large enough n and all i we have 
Tn(yi) < 1- Hence z, £ A^/ 2 (5„) and so S D K. C |J™ x U S / 2 (zi) C Af s (S n ). 

Consider now the second part of ( II 1.59b . If it would not hold, there would exist a sequence {rij} and 
points z nj £ S nj fi K, such that z nj ^ A fs(S). Suppose z njk z. Then dist(z,5) > <5 as well. However, 
writing p{z) < \p(z) - <p{z njk )\ + \p(z njk ) - <p njk {z njk )\ + Vn jk (z njk ) we obtain p{z) < 1 as the first 
term on the right tends to zero because ip is continuous, the second term converge to zero as p n approximate 
p uniformly in compact sets, and p njk ( z n Jk ) < 1- Hence z £ S fl /C which is a contradiction. □ 

Combining these we can state the following result. 

Proposition 11.15. Let 1C C C be compact and <5 > 0. Then, for all large enough n, 

sp e (A) n/C c A//(sp e (A„)), sp e (A„) n/C c Af 5 (sp e (A)). 

The general algorithms. Here A, A n are operators in 'H as in (i), (ii) above, while A n is the matrix 
representing A n when restricted to the finite dimensional invariant subspace TL n = L ri Ti.. In particular 
\\Rn{z)\\ = ||(A„ — z) _1 ||. Denoting by <ti the smallest singular value of a square matrix we have 7 n (z) = 
l/\\R n (z)\\ = oi(A n — zi). Let r > 0 and define G r := B r ( 0) fl (^:(Z + iZ)). Define rjj and by 

(11.60) ri(A) = |z£G„: f 7 1 (I n -z/)<iJ, Yl(A) = {z£G n :<j 1 {A n -zI)<e}, 

which we shall prove to be the towers of algorithms for Si and S 2 (as defined in Theorem l4.5l >. respectively. 
Observe that rjj(A) and I fJA) can be executed with finite amount of arithmetic operations, if the matrices 
A n are available. 

Proposition 11.16. The algorithms satisfy the following: 

ri(A)^sp(A), r 2 (A)-+sp e (A), 


( 11 . 61 ) 


n —> 00. 
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Proof. 1. We begin with the second part of (II 1.6 11 1 It suffices to show that given 5 and a compact ball 1C, 
for large n: 

(i) sp e (A n ) n G n n K. C 7)4(sp e (A)), (ii) sp e (A) n K C Ms{sp e (A n ) n G n ). 

The first inclusion follows immediately from Proposition II 1.151 . To see (ii) we argue by contradiction and 
suppose not. Then by possibly passing to an increasing subsequence {fc ra }neN C N there is a sequence 
z n £ (sp e (A) n 1C) \ J\fg(sp e (Ak n ) D Gk n ) for all n. Since sp e (A) IT K. is a compact set, by possibly 
extracting a subsequence, we have that z n —> zq £ sp e (A) IT 1C. Consider the open ball Us/•$ (^o) which must 
contain all z„ for n sufficiently large. Since 7 (z) is continuous, positive, not constant in any open set and 
without nontrivial local minima, it follows that sp e (A) equals the closure of its interior points. In particular 
int(sp e (A)) D ( 7 , 5 / 3 ( 20 ) ^ 0. Suppose then r > 0 and yo are such that the closure of the open ball U r (yf) 
is inside this open set: B r (y 0 ) C int(sp c (A)) IT Us/ 3 {zo). We claim that sp e (A„) IT U r {yo) = U r (yo) for 
all large enough n. Indeed, since U r (yo) bounded away from the boundary of the pseudospectrum of A, we 
have 7 ( 2 ) < e — s for some s > 0 and for all 2 € U r (yf) Now the claim follows from the locally uniform 
convergence of y n . 

By the definition of G n we have that U r (yf) C Ms/ 3 (U r (yo)nG n ) for large n, so, by the claim, U r (yo ) C 
M s / 3 (sp e (A n ) IT G n ). Hence, since U r (yo) C Ug/ 3 (zo), it follows that 

z n 6 U S / 3 (zo ) C Af 2 s/ 3 {U r {y 0 )) C W 5 (sp e (H n ) n G n ), 

for large n, contradicting z n ^ 7\4(sp e (Al ri ) D G n ). 

2. To prove the first part of (111.61b we argue as follows. Given 5 > 0 and compact 1C, we need to show 
that for large n: 

(iii) sp(.A) IT K. C fifs{sp 1/n (A n ) n G n ) (iv)) Af s (sp(A)) D sp 1 /n (H„) n G n n/C. 

To show (iii), we start by defining G n := -AfL + ih) and note that for A„ £ sp(H„) we have that 
M/n({A„}) IT G n f 0 for every n. Hence, sp(A n ) C N\/ n ^sp(H„)^ nG„j. Thus, since 

Af 1/n (sp(A n )) C sp 1/n (A n ), compare dl 1.58b , it follows that sp(H„) C J\f 1/n ^sp 1/n (H n ) (T G„j. Now 
by the first part of Proposition lll.lll we have that sp(H) IT 1C C J\fs/ 2 {sp(A n )) for large n. Thus, combining 
the previous observations, we have that 

SP(T^) IT 1C C ffs/ 2 +l/n ^®Pl/ra(^n) Gn'j C J\fg ^Sp 1 / rl (H rl ) IT Gn'j , 

for large n. However, since K. is bounded we have that there exists an r > 0 such that if A £ G n IT U r (0) c 
then 7V)'({A}) D sp(H) D 1C = 0 for all n. Hence, sp(H) IT 1C C Ms ^sp 1 / n (H rl ) IT G^j as desired. 

To see (iv), let r > 0 be so large that Afs(U r (0) c ) IT 1C = 0. Note that sp e (H) —> sp(H) as e — > 0. 
Thus, sp £l (H) D B r ( 0) C 7)4/2(sp(H)) for a sufficiently small ei. Also, by the second part of Proposition 
II 1.15l it follows that sp Cl (A n ) D K C Ms / 2 (sp €l (A)) for large n. However, by the choice of r we have that 
sp £i (^n) nice 7)4/2 (sp £l (A) n B r ( 0)). Clearly, sp 1/n (A n ) IT K C sp ei (A„) IT K for large n. Thus, by 
patching the above inclusions together we get that 

sPi /n (A„) n/C C sp ei (A n ) n/C C 7V4/ 2 (sp ei (A) fl5 r (0)) c7V4(sp(A)), 

for large n, as desired. This finishes the proof of Proposition ! 1 1.161 □ 

Next, we pass from these general considerations to the Schrodinger case. 

Compactness of the resolvent. We first show that the resolvent of the Schrodinger operator II is compact. 
To prove this we recall some well known lemmas and definitions from 133. 

Definition 11.17. An operator A on the Hilbert space H is accretive if the Re( Ax, x) > 0 for x £ 'D(.A). It 
is called m-accretive if there exists no proper accretive extension. If A (possibly after shifting with a scalar) 
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is m-accretive and additionally there exists (3 < 7r/2 such that \ sxg(Ax, x)| < ft for all x £ 13(A), then A 
is m-sectorial. 

Lemma 11.18 ( Il57l YI-Theorem 3.3]). Let A be m-sectorial with B = Re A. A has compact resolvent if 
and only if B has. 

Lemma 11.19 ( (57J V-Theorem 3.2]). IfT is closed and the complement o/Num(T) is connected, then 
for every C, in the complement of the closure o/Num(T) the following hold: the kernel ofT — C, is trivial and 
the range ofT — (, is closed with constant codimension. 

Proposition 11.20. Suppose V is continuous R d —> C satisfying the following: V(x) = \ V(x)\e t<p ^ such 
that \V(x)\ —> oo as x —> oo , and there exist nonnegative 9\, 62 such that 9\ + 62 < 7r and —62 < p(x) < 
6 \. Denote by h the operator h = —A + V with domain D(h) = C^°(M d ) and put in L 2 (M . d ) H = h**. 
Then H = — A + V is a densely defined operator with a compact resolvent. 

Proof. The proof goes as follows: Notice first that the numerical range of H lies in a sector with opening 
2/3 < 7 r. Then we turn the sector into the symmetric position around the positive real axis to get the 
operator a(a). It is clearly enough to show that A(a) = a(a)** is an m-sectorial operator with half-angle 
@ = (Oi + 6> 2 )/2 which has a compact resolvent. Next, since the numerical range of a(a) is not the whole 
plane, the operator is closable. Then we conclude that every point away from the numerical range belongs 
to the resolvent set. This is done based on the fact that the adjoint shares the same key properties as A(a). 
Then the compactness of the resolvent follows by considering the resolvent of the real part of A(a). 

Here is the notation. Put a = (9i — 82 )/2 so that |a| < 7r/2. Then with 

(11.62) 9(x) = ip(x) — a 

we have a(a) := e~ la h = —e _M A + \V(xfe 1 ^^ and after extending A(a) = a(a)** , in particular 
H(a) := ReA(a) = —cosa A + cost?(x)|V(j:)|. 

We claim that the operator A(a) := e~ la H is m-sectorial with half-angle /3 = (9\ + 82 )/2. Indeed, it is 
immediate that the numerical range satisfies the following Num(a(a)) C {z = re 1 s : \9\ < /3, r > 0 }, 
which is not the whole complex plane, and we can therefore (by El V-Theorem 3.4 on p. 268]) consider the 
extended closed operator A(a) instead. The next thing is to conclude that points away from this closed sector 
are in the resolvent set of A(a). Take any point (, = re llp with /3 < \<p\ < 7r, r > 0. We need to conclude 
that ( ^ sp(A(a)). Since the complement of Num(A(a)) is connected, the following holds (by Lemma 
111.19b : the operator A(a) — ( has closed range with constant codimension. Thus, we need that the range is 
the whole space. Put for that purpose T = A(a) — (. Suppose there is g 0 such that g £ Ran(T) ± . Then 
for all / £ V(T) we have (Tf, g) = 0 which means, as V(T) is dense, that T*g = 0. But that is not the 
case as A(a)* — ( is also closed whose complement of the numerical range is connected and hence does not 
have a nontrivial kernel. 

The proof of Proposition II 1.20l can now be completed by invoking Lemma 11 1. 1 81 since it is well known 
(ED, Theorem XIII.67) that (since a < 7r/2) the self-adjoint operator H(a) has compact resolvent when 
the potential |R(:r)| tends to infinity with x. □ 

We shall next consider the discretisation of H and of A(a). It shall be clear that the discrete versions have 
their numerical ranges inside the same sectors, where the numerical range of an operator T is denoted by 
Num(T). Thus all resolvents can be estimated using the fact that if (T — £) -1 is regular outside the closure 
of Num(T), then ||(T — C) _1 || < l/dist(C, Num(T)). 

Discretizing the Schrodinger operator. We shall show how to assemble the matrices H„ mentioned 
above. The underlying Hilbert space is again lf(W L ) and we start with approximating the Laplacian. Let 
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1 < j < d, t £ R and define Uj, t to be the one-parameter unitary group of translations 

Uj >t 1 p(x i, . ..,Xd) = 1 p(x i, ...,Xj-t,...,X d ) 

and let Pj be the infinitesimal generator of U ht so that U Jd = e ltPj and Pj = lim^o h(Uj,t — !)■ Thus, 
defining $ n {x) = — 1) with n £ N and x £ R, it follows that 

(11.63) | ^ n \ 2 (Pj)^(x) = n 2 (-ip(x i, ...,Xj + 1/n, ...x d )- ip{x i,. ,.,Xj- l/n, ...x d ) + 2 ijj(x)) 

is the discretized Laplacian in the j direction. The full discretized Laplacian is therefore 

Now we replace V by an appropriate approximation. Consider the lattice (jZ) d as a subset of M' / and for 

y € (-Z) d define the box 


(11.64) 


1 


1 


V ’-^- VI + 2n 


1 < j < d I. 


Qn{y) = | a: = (xi, ...,x d ):Xj € 

Let S n = [— Lv^J, lV™\] d C R d and define E n to be the orthogonal projection onto the subspace 

(11.65) < ip £ L 2 (R d ) : ip = ^ otyX Qn ( y ), a y £ C > , 

{ y€(^) d ns n ) 

where XQ n (y) denotes the characteristic function on Q n {y)- Define the approximate potential as 

V(y) x £ Q n (y) n S„ for some y £ (^) d , 

0 otherwise. 


V n (x) = 


Note that V n = E n V n E n , but that, in general, V n f E n VE n . Finally, we define the approximate Schrodinger 
operator H n : L 2 (R d ) —> L 2 ( R d ) defined as 


( 11 . 66 ) 


H n = E n Y J \$n\ 2 (.P J )E n + V n . 

3 =1 


Remark 11.21. Note that the restriction // n |Ran(/•;„) of H n to the image of E n has a matrix representation 
H n £ C mxm (where m = dim(Ran(£ , ra ))) defined as follows. First, for yi,y% £ (^Z) d FI S n , 


<l$n| 2 {Pj)Enn d/ 2 XQ n { Vl ),n d/ 2 XQ n (y 2 )) = < 


2 n 2 


y i = V2 

2/1 — 2/2 = ±1 /nej 


0 otherwise 


and (V n n d / 2 XQ n ( yi ),n d / 2 XQ n (y 2 )) = V(,Vi) when y\ = yi and zero otherwise. Thus, we can form the 
matrix representation of H n \Ran(E n ) with respect to the orthonormal basis {n d ^ 2 XQ„(y)} y e(-i) d r\S n - ^ * s 
important to note that calculating the matrix elements of H n requires knowledge only of {Vf }/c=A„ where 
we have A n := {f y : y £ ( n~ 1 Z) d flS n } and V fy = f y (V) = V{y). 


We have so far shown that the Assumption (i) holds, and we are left to show that the discretization we 
have chosen satisfies Assumption (ii). In particular, we need to demonstrate that our discretization satisfies 
(111.571) . That is the topic of the following theorem. 

Theorem 11.22. Let V £ C(R d ) be sectorial as defined in (14.51) satisfying |R(x)| — > oo as \x\ —> oo, and 
let h = — A + V with V(h) = C£° (R d ) and let H = h**. Let H n be as in (111.661 ). Then there exists Zq such 
that ||(IT — z 0 ) -1 — (H n — Zo)~ 1 E n \\ —> 0, as n —> oo. 

Remark 11.23 (Proof of Theorem 14.51 ). Note that we immediately have 

Theorem ll 1.221 + Proposition ! 1 1.161 => Theorem l4.5l 
Thus, the rest of the section is devoted to prove Theorem ll 1.221 
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We shall treat the discretizations in a similar way as the continuous case, namely by ’’rotating” the operator 
into symmetric position with respect to the real axis and then, by taking the real part, we are dealing with a 
sequence of self-adjoint invertible operators. Before we prove this theorem we will need a couple of lemmas. 
We recall the following definition. 

Definition 11.24 (Collectively compact). AsetT C B('H) is called collectively compact if the set {Tx : 
T € T, ||x|| < 1} has compact closure. 

Lemma 11.25. Let { K n \ be a collectively compact operator sequence and K* —> 0 strongly. Then 

\\K n \\ ->■ 0 . 

Proof. It is well known that on any compact set B the strong convergence K* —> 0 turns into norm conver¬ 
gence: sup{||Ff*:r|| : x £ B} —> n 0. Since B := c\{K n x : ||x|| < 1, n £ N} is compact, we get 

\\ K n\\ 2 = \\K*K n \\ = sup{||/\*A'„a;|| : ||a;|| < 1} < sup{||AT*y|| : y 6 B} 0 as n-t oo. 


□ 

We also need a modification of Lemma fl 1.181 

Lemma 11.26. Let {A n } be m-sectorial with common semi-angle (3 < 7t/2 and denote B n = Re A n . 
Assume that {£„} is a sequence of orthogonal projections, converging strongly to identity and such that 
A n E n = E n A n E n and B n E n = E n B n E n . Assume further that {Bf 1 } is uniformly bounded. If{Bf 1 E n } 
is collectively compact, then so is {A^En}. 

Proof. Denote by B\J~ the unique self-adjoint non-negative square root of B n . By ll57l VI-Theorem 3.2 on 
p.337] for each A n there exists a bounded symmetric operator C n satisfying ||CVi|| < tan(/3) and such that 
An = Bn /2 ( 1 + i C n )Bn /2 . Writing 

pOO 

Af 1 = jf e~ tAn dt 

we conclude that EnAf^En = Af 1 E n and likewise for Bf 1 . Assume now that { Bf 1 E n } is collectively 
compact. But then so is {( B n + t)~ 1 E n } = {Bf 1 E n (I + tBf 1 )~ 1 E n } and writing, compare Il57l V 
(3.43) on p.282], 

i r°° 

B~V*E n = - / f _ 1 / 2 (f?„ + t)- l E n dt 

11 Jo 

we see that {Bf 1 ^ 2 E n } is also collectively compact and Bf [ ' 2 E n = E n Bf 1 ^ 2 E n . Finally {A^En} is 
then collectively compact as well since Af x E n is of the form Bf 1 ^ 2 E n T n with T n uniformly bounded. □ 

Proof of Theorem ] 1 1.22\ Note that it is clear from the definition of //„ and the assumption on V that 
Nuni(//,,) C { re' r ‘ : —62 < p < 0\.r > 0} for all n. Thus, since If, is bounded and by Proposi¬ 
tion II 1.201 we can choose any point zq £ C such that zq has a positive distance d to the closed sector 
{re lp : —62 < p < 0±,r > 0}, and both R(Lf, z 0 ) = {H — Zo ) -1 and R(H n , zo) = {H n — zf ) -1 for every 
n will exist. Moreover, R(H n , zo) are uniformly bounded for all n, since for every x, ||x|| = 1, 

II (H n - z 0 )x\\ > | ({H n - z 0 )x,x)\ > \(H n x,x) - z 0 \ > d. 

Note that by Lemma fl 1.251 it suffices to show that (i) R(Lf n , zo)*E n —> R(H,zq)* strongly, and (ii) 
{R(H n , zo)E n — R(H, zq )} is collectively compact. 

To see (i) observe that C'£°(R rf ) is a common core for H and for //„. Hence by ll57l VIII-Theorem 
1.5 on p.429], the strong resolvent convergence 11(11,, . zq)* —> IKII. zq)* will follow if we show that 
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H*ip -A H*ip as n —> oo for any ip £ Then the strong convergence R(H n , zo)*E n —> R(H , z 0 )* 

follows as well. Note that 


(11.67) 


\\H* n iP-H*iP\\< 


d d 


3 = 1 


3=1 


+ \\(Yn 


V)ip\\. 


Also, |$ n | 2 (Pj) = n(r_ \/ ne — /)n(Ti/„ e . — /), where r z ip{x) = ip(x — z) and {e^} is the canonical basis 
forR d . Moreover, for ip £ Cp°(R d ), 


E n 1 p= ^ (*n*1p)(y)XQ n (y), 5 ’n = Pn® ■■■&Pn, Pn = n X {^, i), 

y€(±Z.) d nS n 

where S n was defined in (11 1.65b . Thus, it follows from easy calculus manipulations and basic properties 
of convolution that | $ n \ 2 (Pj)E n ip = Ey6(iz)‘ i ( ^, ™ * Pi *o P2 *o V’")(2/)XQ„( y )> where Pi = «X[-i/n,o]. 
f >2 = nx[ 0 ,i/„i and *j denotes the convolution operation in the jth variable. By standard properties of the 
convolution we have that fyn * pi *j p 2 *j ip” ip” uniformly as n —> oo. Thus, since ip £ C£°(M d ), 
the first part of the right hand side of (11 1 .67b tends to zero as n —> oo. Due to the continuity of V and the 
bounded support of ip it also follows easily that \\{V n — V)ip\\ —> 0 as n —> oo. 

To see (ii) we use the same trick as in the proof of Proposition II 1.201 In particular, first set zq = —e la 
(which is clearly in the resolvent set of H n for a = (Pi — (? 2 )/ 2 ) then let A n (a) = e~ la (H n — Zq) and further 
H n (a) = Re A n (a). Note that, by Lemma fl 1.261 we would be done if we could show that {7T„(a) -1 } is 
uniformly bounded and {H n (a)~ 1 E n } is collectively compact as that would yield collective compactness 
of { A n (a)~ 1 E n } and hence of {R(H ni zo)E n }. To establish collective compactness, note that 

d 

(11.68) H n {a) = cosaE n ^2 \®n\ 2 {Pj)E n + cost?(x)|14(x)| + 1, 

3 =i 

where d is defined in (11 1.62k Thus ||7T n (a) _ 1 1| < 1 and by applying Lemma fl 1.271 we are now done. □ 


Lemma 11.27. Let H n (a) be given by A11.68\) . Then the set {H n (a) 1 E n } is collectively compact. 


Proof. We shall show that if we choose an arbitrary sequence {ip n } C L 2 (S. d ) satisfying ||'0n|| < 1, then the 
sequence {pn} where ip n = H n (a)~ l E n ip n , is relatively compact in L 2 (M. d ). The compactness argument 
is based on the Rellich’s criterion. 


Lemma 11.28 (Rellich’s criterion ( lf78l Theorem XIII.65)). Let F[x) and G(ui) be two measurable 
nonnegative functions becoming larger than any constant for all large enough \x\ and |w|. Then 

S = {ip : J \p(x)\ 2 dx < 1 , J F(x)\ip(x)\ 2 dx < 1 , J G(w)| Fp(uj)\ 2 duj < 1 } 

is a compact subset ofL 2 ( R d ). 


To prove Lemma fl 1.27l we proceed as follows. First we conclude that { ip n } is a bounded sequence itself. 
Then, in order to be able to define suitable functions F, G we need to approximate the sequence by another 
one of the form T* tp n . This approximation shall satisfy linin^oo ||'J' n * ip n — p n \\ =0 and this is very 
similar to the standard result on local uniform convergence of mollifications of continuous functions. Then 
the Rellich’s criterion holds for * tp n with F[x ) essentially given by |V(x)| and G(ui) by |cc| 2 . We then 
conclude that the sequence {Tin * tp n } is relatively compact. But since linin^oo ||’J' n * p n — tp n || = 0, the 
sequence {</?„} is relatively compact as well, completing the argument. 

More precisely, since |$(x)| < a < n /2 we have from (11 1 . 68 b 


(y 'j l$n| (Pj) l Prii Tn) + (l^nl^ru Pn) 

3 =1 


(11.69) 


\{H n (a)(p n ,<Pn)\ > cosa 


+ M 2 - 
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But | (H n (a)ip n , (p n )| is bounded not only from below but also from above. Indeed, \(H n (a)ip n , ip n )\ = 
\(E n i/j n , ip n )\ < \\H n (a)~ 1 E n \\ 11V’rj,11 2 • Thus, we conclude first from (111.691 ) that the sequence {f™} is 
bounded. Next, in view of (11 1.69l >. there exist constants C'-|, C -2 > 0 such that for all n G N 

d 

(11.70a) <£>"l 2 (^>n,^n)<C 1 , 

1=1 


(11.70b) 


(|Tn|Fn; Fn) ^ C 2 . 


First we use the bound (11 1.70al >. Letting T denote the Fourier transform, we have that (F<f> n (Pj)tp n )(ijj) = 
for a.e. uj and for 1 < 3 < d. Letting 0 n (w) = sm ^/ 2 n”' > ’ an application of the Fourier 
transform to (111.70al) along with Plancherel’s theorem yield 

f \(E<p n )(u )\ 2 V \ujjQ n (ujj )\ 2 duj < Ci. 

J ^ d i ti<d 

Moreover, since |0 n (w)| < 1 for all ui, we get 

(11.71) f |w| 2 |0„(wi) • • • 0 n (w d )| 2 |(J r (^ n )(w )| 2 du: < Ci. 

J R rf 


We now define the approximation ^f n *(p n . Let T , i(x) = X[-i/2,i/2] d (~) and further 'F„(z) = n^i^nz), 
where \a{z) is the usual characteristic function for the set A. We shall prove below that lim-n^oo ||\H„ * 
ip n — <Pn || = 0, which in particular shows that the sequence {(!/„ * is bounded. Observe then that 
(J 7 T' rl )(u;) = 0„(wi) • • • Q n (u}d)- Therefore we obtain from (11 1.711) 


|u>| 2 |.F(^ n * tp n ){u>)\ 2 duj < Ci, 


which shows that we can choose G(uj) to be (a constant times) |w| 2 . 

We still need to establish the growth function F(x) for 'f „ * ip n . Consider tp n . It is of the form <p n = 
(E n + EnBnEn )- 1 E n ip n and hence E n <p n = ip n . Therefore tp n vanishes outside S n and we can essentially 
replace V n by V in the inequality (11 1.70bl) . To that end, put F{x) = min| !/ |>| a .| |V’(j/)|. Then with some 
constant C 3 

(11.72) f F(x)\(^n* ip n )(x)\ 2 dx <C 3 . 

J R<* 


In view of the bounds (111.71l >. (11 1 .72b and since the sequence {* Fn}neN is bounded in L 2 , Rel- 
lich’s criterion implies that {iPn * <p n }ne n is a relatively compact sequence and it therefore follows that 
{^nj-neN is relatively compact, thus finishing the proof. Hence, our only remaining obligation is to show 
that liniyj^oo H'l'n * ip n ~ ip n || = 0. This result is very similar to the standard result on local uniform 
convergence of mollifications of continuous functions. 

Let z e R. d and define the shift operator r~ on L 2 (R d ) by r z f(x) = /(x — z). Now observe that by 
Minkowski’s inequality for integrals it follows that 


(11.73) 

||^n * Fn ‘Frill til 


/ 


I Tl z (fi n - ^ n |||^i(z)| dz = 


|e^...e^ Pl 


[-l/ 2 ,l/ 2 } d 


<Pn - Fn|l dz. 


The claim follows from an e/d argument and (11 1 .73b combined with the dominated convergence theorem 
(recall that { ip n } is bounded): we need to show that for fixed z £ [— 1 / 2 , 1 /2] d and for any 1 < j < d. 


(11.74) lim 

n—too 


J — Pi 


... e 


iilPi 


Fn - e 


p, 

... e " 1 Fr 


= 0, lim 

n—too 




Fr 


= 0 . 


Since e* » Pj e l « Pk = e’ « Pk e l « Pj and ||e l » Pj ■ ■ ■ e 1 » Pl || < 1 for 1 < j,k < d, (11 1. 741 will follow 
if we can show that \\(e l ~ Pj — I)ip n || —>■ 0 as n —»■ oo. Note that, by the choice of the projections E n , 
it follows that for 1 < j < d, \((e l ^ Pj — I)tp n )(x)\ < |((e*»- p j — I)ip n )( x)\, for 0 < Zj < 1/2 and 
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x £ Also, |((e* " Pj — I)ip n )(x) | < |((e — /)<^ n )(a;)| for —1/2 < < 0. However the bound 

J 2 i<j<d \\®n(Pj)Vn\\ 2 < Cl implies that lim^oo ||(e ±l « p ^ - I)Pn)\\ = 0, which proves the claim. 

□ 


12. Proofs of theorems in section[5] 


Proof of Theorem 1372] Step I: Clearly, SCI(3, Hi)a > SCI(S, Hi)g > SCI(S, H 2 )g> and SCI(s, Hi)a > 
SCI(S, fi 2 )a > SCI(3, H 2 )g- We start by showing that SCI(3, H 2 )g > 2. For n, m £ N \ {1} let 


(l/r 


Bn,m • — 


V 1 


1 \ 


1 /m) 


£ <C" 


and for a sequence {l n }n&N C N \ {1} set 


A Pz„,n+ 1 . 

n— 1 

Clearly, A defines an invertible operator on / 2 (N). Furthermore, we define fr = {bj} £ / 2 (N) such that 

{ 1 n 

— 2 t=X>+i, ««. 

0 otherwise. 


Let also C m := diag{l/m, 1,1,...} and note that its inverse is given by diag{m, 1,1,...}. We argue by 
contradiction and suppose that there is a General tower of algorithms I’ r , of height one such that I ( A. b ) —> 
S(A, b) as n —> oo for arbitrary A and b. For every A, b and k £ N let N(A, b, k) denote the smallest 
integer such that the evaluations from Ar fc (A, b) only take the matrix entries A,j = (Aej, e/) with i,j < 
N ( A , b, k) and the entries bi with i < N ( A , fr, k) into account. To obtain a particular counterexample (A, fr) 
we construct sequences {Z„} ne N an d {fc„} ne z + inductively such that A and fr are given by {£„} as above but 
r fen (A, fr) -f* 3(A, fr). As a start, set ko = lo '■= 1- The sequence (x ) 1 := (C' 2 ) _ 1 f : ifr has a 1 at its first 

entry and since, by assumption, —> 3, there is a A: i such that, for all k > ki , the first entry of T*, (C 2 , Pi fr) 
is closer to 1 than 1/2. Then, choose 1 1 > 7V(C 2 , Pi&, ki) — Iq. Now, for n > 1, suppose that lo,..., l n -i 
and ko,..., k n -i are already chosen. Set s n := h- Then also P Sn b is already determined and 

x/C = 1, where {a^ n) } jeN := (B ha ® B hi3 © ... © Bi n _ ljU © C n+ i)~ 1 P Sn b. 

Since, by assumption, r& —> 3, there is a k n such that for all k > k n 

\ x (n,k ] _ J| < i/ 2 , where {®$ n ’ fc) } j{ : N := T k(B ll>2 © B ^ 3 © ... © B ln _^ n © C n+u P Sn b). 

Now, choose l n > N(B hy2 © -B ; 2 ,3 © ... © © C n +\, P Sn b, k n ) - l 0 - 1 1 - ... - l n - 1 - By this 

construction we get for the resulting A and & that for every n 


r fc „(A, fr) = r fcn (H ;ij2 © B h , 3 © ... © B ltl _ 1>n © C n+ i, P Sn b). 

In particular Hindoo Tfe(A, fr) does not exist in l 2 ( N), a contradiction. 

Step II: To show that SCI(S, Hi )a < 2, let A be invertible and Ax = b with the unknown x. Since 
P m are compact projections converging strongly to the identity, we get that the ranks rk P m = rk( AP m ) = 
rk (P n AP m ) for every m and all n>n 3 with an n 3 depending on m and A. Then, obviously, P m A*P n AP m 
is an invertible operator on Ran(P m ), and we can define 

^ ( a u\ / {0}jgn if <xi{P m A*P n AP m ) < A 

I (P rn A*P n AP m )~ 1 P m A*P n b otherwise. 
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Note that for every A, b, to, n in view of Proposition 110.11 and any standard algorithm for finite dimensional 
linear problems, these approximate solutions can be computed by finitely many arithmetic operations on 
finitely many entries of A and b, hence L mn are general algorithms in the sense of Definition l2.3l and require 
only a finite number of arithmetic operations. Moreover, they converge to y m := (P m A*AP m )~ 1 P rn A*b as 
n —> oo. It is well known that y m is also a (least squares) solution of the optimization problem \\AP m y — 
&|| —> min, that is 

II AP m y m - 6|| < \\AP m x - &|| < ||A||||P m a; - A~ x b\\ = ||A||||P m ic - tr|| 0 

as to —> oo. Therefore ||y m — x|| = \\P m Um — x|| is not greater than 

\\A~ 1 \\\\A(P m y m - x)|| = \\A~ 1 \\\\AP m y m - 6|| < ||A _1 ||||A||||P m x - x|| 0, 

which yields the convergence y m —> x and finishes the proof of Step II. 

Step III: Let / be a bound on the dispersion of A. The smallest singular values of the operators AP m 
are uniformly bounded below by || A _1 || _1 which, together with ||Pj( m ) AP m — AP m || —> 0, yields that the 
limes inferior of the smallest singular values of Pj( m ) AP m is positive, hence the inverses of the operators 
B m := P m A* AP m and C m := P m A*Pfi m -)AP m on the range of P m exist for sufficiently large m and have 
uniformly bounded norm. Moreover, HP ” 1 — C'“ 1 || < H-B^HHCTi — ^m|ll|Cm 1 |l tend to zero as m —> oo. 

This particularly implies that the norms \\y m — (P m A*Pf^AP rn )~ 1 P m A*b\\ with y m as above tend to 
zero as m —> oo, and we easily conclude that the norms \\y m — T m j( m ) ( A , b) || tend to zero as well. With the 
convergence \\y m — 6 c|| —0 from the previous proof, now also ||ai — Y m j( m ) (-4, 6 ) || —>• 0 holds as m —> oo, 
which is the assertion SCI(S, YIs)a < 1. Clearly SCI(B, fl 3 )G > 1. □ 

Remark 12.1. The technique used with uneven sections to obtain the bound SCI(5, J2 i)a < 2 is also 
referred to as asymptotic Moore-Penrose inversion as well as modified (or non-symmetric) finite section 
method in the literature, although written in a different form, and is widely used (see e.g. SaEUElMlED). 
Also the idea to exploit bounds on the off diagonal decay is considered e.g. in li44l or in the theory of 
band-dominated operators and operators of Wiener type (cf. Il63ll771l83l f. 

Proof of Theorem [5. A We proceed in a similar way as in the previous proofs. Let {(„} n gN be some sequence 
of integers l n > 2. Define 

A 

0 

eC nx ". 

o 

i/ 

Clearly, such A are invertible and their inverses have norm one. Suppose that {IT} is a height-one General 
tower of algorithms which in its fcth step only reads information contained in the first N(A. k) x N(A. k) 
entries of the input A. In order to find a counterexample we again construct an appropriate sequence {(„} C 
N\{1} by induction: For C := diag{l, 0,0, 0,...} one obviously has || (C — P) — 1 1| = oo. As a start, choose 
k 0 := 1 and l\ > N(C — I, ko). Now, suppose that are already chosen. Then the operator given 

by the matrix P ; 1 ® ... © Bi n © C — I is not invertible, hence there exists a k n such that, for every k > k n , 

r k {B h ® ...®B ln ®C-I) > 2. 

Now finish the construction by choosing l n+ i > N(Bi 1 © ... © Bi n ®C — I, k n ) — l± — I 2 — ■ ■ ■ — l n - 
So, we see that 


A := 0 B ln - I, B n := 


T kn (A) = r kn {B h © ... © B ln © C - I) ^ ||A" 1 1| = 1, n -A 00, 
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a contradiction. Thus SCI(S, f> i) g > 2. In order to prove the equalities over Cl | and Cl 2 we introduce the 
numbers 


1~\\A x || 1 = min{c 7 i(Al),f 7 i(A*)} 

7 m ■= mm{o 1 (AP m ),cr 1 (A*P rn )} 

7 m,n ■— min{cr-i (P n AP rn ^j, (T\ (P n A 

5 m ,n '■= min {k/m : k G N ,k/m > oi(P n AP m ) or k/m > a\{P n A*P m )} 

and note that im 7 , and 7 m ,n t n 7 m for every fixed to. Moreover, {5 m , n } n is bounded and monotone, 
and 7m, n < b m ,n < 7 m ,n + 1/m. Thus, {5 m , n } n converges for every to, and for e > 0 there is an Too, and 
for every to > too there is an no = no(m) such that 

(12.1) |7 ^ 17 7m | T" |7m 7m, n\ T" |7m,n ^m,n| ^ 6/3 e/3 T~ 1/TO ^ 6 

whenever to > mo and n > no(m). Since (5 m ,n and hence T m , n (A) := 5~ l n can again be computed with 
finitely many arithmetic operations by Proposition 1 10. II this provides an Arithmetic tower of algorithms of 
height two, hence easily completes the proof for fli and O a . On If- we apply (112. Il l with n = f(m ) and 
straightforwardly check that r m (A) := , does the job. □ 


13. Proofs of theorems in section[6] 


Proof of Theorems \6. 1\ \6.2\ and \6.4\ We start with Theorem l6.2l Suppose that there exists a sequence {T n } 
of general algorithm f „ : such that for any A G fl, d(T p p (A)i(^): S(A)) < Define, 

for n G N, the mapping T n : O — > M. by 

f ra (A) = rf n (A) fc ,...,f„(A)i(^)> 

and Ap (A) = Ar f {A) f (A) (A)UAp (A). If we can show that T n is a general algorithm the proof is 
finished, as that would, via (113. Il l, imply that there is a height one tower of algorithms for the computational 
problem {5, f2, A4, A}, and that violates the assumption that SCI(S, f2, A4 , A)g > 2. 

To prove that I n is a general algorithm, we first note that (i) and (ii) in Definition 12.31 are immediately 
satisfied, and thus we only concentrate on showing (iii). Suppose that B G fl and /(A) = f(B) for all 
f G Ap (A). We claim that Ap ( B ) = Ap (A) and observe that this would imply (iii). Indeed, the 
assumption implies that /(A) = f(B) for all f G Ap (A) so, by (iii) in Definition 12.31 we have that 
Ap ( B ) = Ap (A), moreover, it follows that T„(A) = T n (B). Now, again by using (iii) of Definition 

E3 since for all / G ^r fnWk . fn(A)l (A) we have /(A) = f(B), it follows that Ar fn( ^( B ) = 

A p f (A) f (A) (A). Thus, since T ra (A) = T n (B) this implies the claim and we are done. 

Note that the proof of Theorem 16.41 is almost identical, thus we omit the details. For Theorem 16. II iust 
notice that indeed, if such integers nk{m ),..., ni(ro) exist then r m (A) = r nfc ( m ),..., rai ( m )(A) would define 
a General tower of height one. 

□ 


14. Proofs of theorems in Section[7] 

Proof of Theorem \7.1\ Step I: The assertion for Si is easy, so we start by discussing H 2 . To see that 
SCI(S 2 , !2i)g > 1, we argue by contradiction and assume that there is a General tower of algorithms 
{I V}npN °f height one which can answer this question. Then there exist strictly increasing sequences 
{*/c}fceN C N such that, for the sequence {ai}j £ N which has Is exactly at the positions ik, the 
algorithms T„ fc , applied to {ai}j 6 N, answer r rafc ({ai}) = No for all rife. This is proved by induction: Set 
(a/}j g N := {0, 0,...}. Then there is an n\ such that T n ({a^}) = No for all n > n\. Further, the set of 
evaluations Ar ({a/}) of l’ n| is finite, i.e. in simple words l’ n| only looks at a finite number of entries, 
lets say entries with index less than i\. Next, assume that ik, are already chosen for k = 1,..., to. Let 
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{«r +1 } i£ N denote the sequence which has entry 1 exactly at the positions and zeros everywhere 

else. Then, by the assumption that T rl ({a" l+1 }) —> 3 2 ({a™ +1 }) = No as n —> oo, there is an n m + 1 greater 
than n m such that T„({a™ +1 }) = No for all n > n m + 1. Since r „ m+1 evaluates only finitely many entries 
of {a’ Tl+ 1 }j 6 N there exists an i m . j_i > i m such that r „ m+1 only looks at the positions less than i m+ 1 of 

{«r +1 i- 

Now consider the sequence {a^ieN which has entry 1 exactly at the positions ik, k £ N. Then we 
observe that T nk ({a,}) = 1 \ lk ({ af }) = No for every k, taking (iii) in Definition 12. 3 1 into account, hence 
limfc r nk ({ ai }) = No ^ 5 2 ({ai}), a contradiction. 

To see that SCI(S 2 ,f2i)A < 2 define r mjn ({ a i}*eN) = Yes when i a * > m - Here Ar m n ({ai}) 
consists of the evaluations fj : {a^ieN H t aj with j = 1,... ,n. Obviously, these mappings T min are 
General algorithms in the sense of Definition [23] If we define I\ n ({ a,}) = Yes when > m ’ then 

we can observe that r m) „({ai}) —> r m ({ai}) as n —> oo and T m ({ai}) —> 5 2 ({ai}) as m —> oo. Thus, 
SCI(S 2 ,0 1 ) G < SCI(S 2 ,0 1 ) A < 2, and we have shown the assertions for S 2 . 

Step II: Note that it is easy to translate E;_s and 34 into the previous problems: just take an enumeration 
of the elements of N 2 , that is a bijection N —> N 2 , k ( i{k),j (fc)) in order to regard {aij} as the sequence 
{o-i(k).j(k)}k, which yields that 33 ( 34 ) is equivalent to Si (S 2 , respectively). Hence, SCI(53,0 2 )g = 
SCI(S 3 ,0 2 ) A = 1 and SCI(S 4 ,0 2 ) G = SCI(S 4 ,0 2 ) A = 2. □ 

Before we continue with the proofs we need to introduce some helpful background. Equip the set Di of 
all sequences {:Ci}i g N C {0,1} with the following metric: 

(14.1) d B ({xi},{yi}) := ^ 3 _n |x n - y n \. 

n£ N 

The resulting metric space is known as the Cantor space. By the usual enumeration of the elements of N 2 this 
metric translates to a metric on the set fl 2 of all matrices A = {aij}i with entries in {0,1}. Similarly, 
we do this for the set H 3 of all matrices A = {di,j}i,jez with entries in {0,1}. In each case this gives a 
complete metric space, hence a so called Baire space, i.e. it is of second category (in itself). To make this 
precise we recall the following definitions: 

Definition 14.1 (Meager set). A set S Cd in a metric space f l is nowhere dense if every open set U C H 
has an open subset V C U such that V fl S = 0, i.e. if the interior of the closure of S is empty. A set S C fl 
is meager (or of first category) if it is an at most countable union of nowhere dense sets. Otherwise S is 
nonmeager (or of second category). 

Notice that every subset of a meager set is meager, as is every countable union of meager sets. By the 
Baire category theorem, every (nonempty) complete metric space is nonmeager. 

Definition 14.2 (Initial segment). We call a finite matrix a £ C nx m an initial segment for an infinite 
matrix A £ fl 2 and say that A is an extension of a if a is in the upper left corner of A. In particular, 
a = P n AP m for some n, m £ N, where we, with slight abuse of notation, consider P n AP m £ C nxm . P n is 
as usual the projection onto span {e_j}” =1 , where {ejjjgN is the canonical basis for Z 2 (N). 

Similarly, a finite matrix a £ c( 2n + 1 ) x ( 2m + 1 ) i s an initial segment for an infinite matrix B £ Q 3 if a is 
in the center of B i.e. a = P n BP m where P n is the projection onto span {e :) }"__ n , where {ej}jgz is the 
canonical basis for / 2 (Z). We denote that A is an extension of a by a C A, and the set of all extensions of a 
by E(a). 

The notion of extension extends in an obvious way to finite matrices. 

Notice that the set E(a) of all extensions of a is a nonempty open and closed neighborhood for every 
extension of cr. 
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Lemma 14.3. Let {r n } n6 N be a sequence of General algorithms mapping if —> Ml, T C fL he a 
nonempty closed set, and S C T be a nonmeager set (in T) such that £ = lim^-nx, T n (A) exists and is the 
same for all A £ S. Then there exists an initial segment a and a number no such that E T (o) := T D E(cr) 
is not empty, and such that T n (A) = £/or all A £ E T (a) and all n > no- The same statement is true if we 
consider LI 3 instead of LI 2 . 

Proof We are in a complete metric space T. Since S = (JfceN &k with Sk := {A £ S : T n (A) = £ Vn > k} 
and S is nonmeager, not all of the Sr can be meager, hence there is a nonmeager Sk, and we set no := k. 
Now, let A be in the closure S no , i.e. there is a sequence {Aj} C S no converging to A. Note that by 
assumption (i) in Definition 12.31 and the fact that are General algorithms, we have that, for every fixed 
n > no, |Ar„(A)| < 00 . Thus, by (ii) in Definition 12.31 the General algorithm T n only depends on a 
finite part of A, in particular {Af}feA r „(A) where Af = /(A). Since each / £ Ar„(A) represents a 
coordinate evaluation of A and by the definition of the metric dB in (114.1b . it follows that for all sufficiently 
large j, f(A) = f{Af) for all / £ Ar n (A). By assumption (iii) in Definition 12.31 it then follows that 
Ar„(-Aj) = Ar„ (A) for all sufficiently large j. Hence, by assumption (ii) in Definition 12.31 we have that 
T n (A) = r n (Aj) = £ for all sufficiently large j. Thus, T n (A) = £ for all n > no and all A £ S no . Since 
S no is not nowhere dense, we can choose a point A in the interior of S no and fix a sufficiently large initial 
segment a of A such that E T (a) is a subset of S no . The assertion of the lemma now follows. The extension 
of the proof to CI 3 is clear. □ 

Roughly speaking, this shows that there is a nice open and closed nonmeager subspace of T for which 
lim n _ >00 T„ (A) exists even in a uniform manner. Note that this result particularly applies to the case 7' = V.. 

Proof of Theorem \7.2\ Step I: SCI(S 5 ,S 72 )g > 3. We argue by contradiction and assume that there is a 
height two tower {T r }, {T r s } for H 5 , where T r denote, as usual, the pointwise limits lim^oo T r s . We 
will inductively construct initial segments {cr n } with er n +i D a n yielding an infinite matrix A D a n for 
all n £ N, such that Hindoo T r (A) does not exist. We construct {o n } with the help of two sequences of 
subsets {T n } and {S'n} of f2, with the properties that T n+ 1 C T n , each T n is closed, and either T n = Q or 
there is an initial segment o £ C mxm where m> n such that T n is the set of all extensions of a with all the 
remaining entries in the first n columns being zero. 

Suppose that we have chosen T n . Note that the subset of all matrices in T n with one particular entry 
being fixed is closed in T n , hence the set of all matrices with one particular column being fixed is closed 
(as an intersection of closed sets). The latter set has no interior points in T n , hence is nowhere dense in T n . 
This provides that the set of all matrices in T n for which a particular column has only finitely many Is is a 
countable union of nowhere dense sets in T n , hence is meager in T n . Taking intersections we find that the 
set of all matrices in T n having only finitely many Is in each of their columns is meager in T n as well. Let 
R be its complement in T n , i.e. the nonmeager set of all matrices A £ T„ with S-J A) = Yes. 

Clearly, R = (J r eN := £ R '■ Tfc(A) = Yes Vfc > r}, and there is an r n such that S n := 

R Tn is nonmeager in T n . Note that r rniS are General algorithms and r rn (A) = lim^oo T rniS (A) = Yes 
for all A £ S n . Thus, Lemma [143] applies and yields an initial segment a n , such that 

(14.2) E Tn {<j n ) A 0 and T r „ ( A ) = Yes for all A £ E Tn ( a n ). 

Now, let T n+ i C 7'„ be the (closed) set of all matrices in E Tn (a n ) with all remainingentries in the first 
n + 1 columns being zero. Letting To = fl we have completed the construction. 

The nested initial segments 1 D a n obviously yield a matrix A £ n^L 0 T n and this A has only finitely 
many Is in each of its columns. However, by the construction of {T n }, we have that A £ E T " (a,,) for all 
n £ N. Thus, 55 (A) = No, but by (114.2b . Tfe(A) = Yes for infinitely many k. 

I.e. outside the initial segment a n . 
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Step II: SCI(S 5 ,n 2 ) A < 3. Let = Yes when i ai J > 171 f° r at ^ east one 

j £ {1,..., fc}, and No otherwise. Here Ar fc m „ ({«i ,j}) consists of the evaluations f rs : {tZij}ije n h► a r , s 
with r = 1,..., n and r = 1,..., k. Further, define Tk t m({ai,j}) = Yes when YliLi a i,j > 171 f° r at l east 
one j £ {1,..., k}, and let Tfc({tZi,j}) = Yes when X^i a i,j = 00 f° r at l east one J £ {1,..., A;}. It is 
straightforwardly checked that this yields an Arithmetic tower of algorithms, hence 801(5-,, 0 2 ) a < 3, and 
thus we have shown the assertions for Eg. 

Step III: SCI(S 5 ,n 2 ) G < SCI(S 6 ,n 2 ) G and SCI(S 6 ,L! 2 )a < 4. To see the former, suppose we are 
given {dij}, then define a new matrix by bij := max{aj jS : s = 1,... ,j}. Then {a t ,j} has a column 
with infinitely many non-zero entries if and only if {dj} has infinitely many columns with infinitely many 
non-zero entries. On the other hand, there is an Arithmetic tower of algorithms of height 4 for 56. In¬ 
deed, let Yi t k,m,n{{aij}) = Yes when Xa=i a i,j > 171 f° r more than l numbers j £ {1,..., k}. Also 
define Ti t k, m {{aij}) = Yes when Yli=i a i,j > rn f° r more than l numbers j £ {l,...,fc}, as well 
as Tj ^daij}) = Yes when a i,j = 00 f° r more than l numbers j £ {1,..., k}. And finally set 

rf({ a i,j}) = Yes when a bj = 00 f° r more than l numbers j £ N. It is easy to see that this is an 

Arithmetic tower of algorithms, hence we have shown that SCI(Sg, 0 2 ) a < 4. 

Step TV: SCI(S 7 ,fl 2 ) G > 3. Indeed, one can proceed as in the proof of Step I: The set of all matrices 
with infinitely many “finite columns” is meager since it is a subset of the union of the meager sets 14 of all 
matrices with the fcth column being finite. Thus, the set {A : 5 7 (.4) = Yes} is nonmeager. 

Then exactly the same construction as above yields a sequence of closed spaces T n and a matrix A in 
their intersection which has only finitely many Is in each of its columns, but T r „ (A) = Yes for all r n of a 
certain sequence {r n }, a contradiction. 

Step V: SCI(5g, fd) G > 3. The proof is very similar to the proof of Step I. In particular, we argue by 
contradiction and assume that there is a height two tower {T r }, {L TVS } for 5§. As above, we inductively 
construct initial segments {cr n } with cr n +i D a n yielding an infinite matrix A D a n for all n £ N, such that 
linv^oo T r (A) does not exist. We construct {a n } with the help of two sequences of subsets {T„} and {S^} 
of SI, with the properties that T n+ 1 C T n , each T n is closed, and either T n = SI 3 or there is an initial segment 
a £ c( 2m + 1 ) x ( 2m + 1 ) where m> n such that T n is the set of all extensions of a with all ±nth semi-columns 
being filled by n additional Is and infinitely many Os, and and all the other fcth columns, |fc| < n — 1, are 
being filled with zeros. In particular, if {ajjhjez £ T n then 

• • j 0 ) 1) • * • ; 1 : ■ ■ ■ • 1 , . . . , 1 , 0, . . .} , 

(14.3) n times n times 

{tZ i^k}i^:7 J — d • • ) 0 ) @—ra,k-i ■ • ■ : 0 ? ■ • 5 fc £ ^+5 |fc| 5 ZZ 1 . 

Suppose that we have chosen T n . We argue as in Step I and deduce that for fc £ Z the set of all matrices 
in T n with one of the two fcth semi-columns being fixed is nowhere dense in T n , hence the set of all matrices 
in T n with (one of the two) fcth semi-columns having finitely many Is is meager in T n . We conclude that the 
set of all matrices in T n with one semi-column having finitely many Is is meager, thus its complement in T n , 
the set of all matrices with all semi-columns having infinitely many Is, is nonmeager. Therefore the same 
holds for the superset {A £ T n : 5g(A) = Yes}. Denoting this set by R we obviously have R = U r eN 
with R r := {A £ R : Yk{A) = Yes Vfc > r}, and there is an r n such that S n := R Tn is nonmeager in 
T n . Note that T rnjS are General algorithms and T rn (A) = lim^oo T rrljS (A) = Yes for all A £ S n . Thus, 
Lemma fl4.3l ar)plies and yields an initial segment a n , such that 

(14.4) E Tn (cr n ) ^ 0 and T r „ (A) = Yes for all A £ E Tn (a n ). 

Now, let T n+ i C T n be the (closed) set of all matrices {T-Aj'li.jeN i * 1 E Tn (cr ra ) with the property that (114.31) 
holds with a = a n . Letting To = fI 3 concludes the construction. The nested sequence {<j n }again defines 
a matrix A £ {Y^L 0 T n with the property that A has finitely many but at least fc non-zero entries in the each 
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of its fcth semi-column which gives (-4.) = No, but, by (1 14.4b . T t ( A) = Yes for infinitely many k, a 
contradiction. 

Step VII: SCI(S 7 , n 2 ) A < 3 and SCI(H 8 , n 3 ) A < 3. This can again be proved by defining an appropri¬ 
ate tower of height 3 directly, as was done for E-, in Step II, for example. For E 7 

n 

Yk,m,n — YeS ■w' | {) — 1, . . . , Ul . ^ ^ ^ rtt} | <C k 

i=l 

may do the job. A more elegant way is to translate and to employ the already proved results on spectra, see 
Remark fl4.4l □ 


With the lower bounds of the SCI of the decision problems S 7 and Eg established we can now get the 
lower bounds of the SCI of spectra and essential spectra of operators. 


Proof of Step III in the proof of Theorem \3.8\ in Section [7T7| First, note that without loss of generality we may 
identify fli = B(l 2 (N)) with fl = B(X), where X = ® = l 2 ( N, X n ) is the space of square-summable 

sequences of elements in X n , where X n = Z 2 (Z). For this just choose any enumeration (i.e. bijection) 
N — > N x Z to get Z 2 (N) ~ l 2 (N x Z) ~ X. Second, for the sake of simplicity in notation we identify a 
sequence {cpj-igN with its extension {af z where af := for i > 0 and a\ := 1 for all i < 0. 

Let a = {ai}i e N be a sequence with a* £ {0,1}. Define the operator B a £ B(l 2 (Z)) as follows 


ej : if af = 1 , l = maxjf : j < i, of = 1 } 
(14.5) B aei :={ 1 ^ 

I ei : otherwise, 


where {ejjj^z is the canonical basis for 1 2 {T). Hence, B a acts as a shift on the basis elements {ej : a® = 1}, 
and as the identity on all other basis elements. Clearly, B a has norm 1 and if a = {a ,} has infinitely many 
Is then B a is invertible and its inverse (the “reverse shift”, actually given by its transpose f?J) has norm 1 as 
well. On the other hand, if a has only finitely many Is then B a is Fredholm of index —1 (again just look at 
the regularizer f?J). Moreover, notice that evaluating a finite number of entries of B a requires the evaluation 
of only a finite number of «,s. Now, given a matrix { a l 3 } £ fl we take the operators lit = B{ a , fc } ieN 
arising from the fcth column of { a, 3 } via ( 1 1 4.5| i, respectively, and define the diagonal operator 

OO 

k =1 


on X. Obviously, its norm is 1. If only finitely many If,- are non-invertible (i.e. only finitely many columns 
of {ai j } do not have infinitely many nonzero entries) then the diagonal operator defined from the regularizes 
lf[ is a regularizer of norm 1 for C, hence sp ess (C) is contained in the unit circle in that case. Otherwise, if 
infinitely many Bk are non-invertible then C is not Fredholm, i.e. 0 £ sp ess (C'). Therefore we can conclude 
the following: If there was a tower {r m }, { r„ v „ } of height two for the essential spectrum (of C or its 
respective Oi-counterpart), then we would get a tower of height two for S 7 by 


(14.6) 


r m ,n{{aij}) := Yes if and only if r mj „(C') O B 1/2 ( 0) = 0, 
f := Yes if and only if T m (C) O B 1 / 2 (0) = 0, 


contradicting the fact that the SCI of S 7 is three, which is established in Theorem [73] 


□ 


Proof of Step III in the proof of Theorem \3 . 71 As above, first observe that, without loss of generality, we may 
identify fli = B(l 2 ( N)) with f l = B{X), where X = ®^E_ 00 X„ in the Z 2 -sense and where X n = l 2 (If). 
Second, we consider sequences a = {ai}j 6 z over Z with a, £ {0,1}, and define respective operators 
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B a £ £>(7 2 (Z)) with matrix representation B a = {hfc,,} by 


bk,i 


! 1 : k = i and a k = 0 

1 : k < i and a k = ai = 1 and a,j = 0 for all k < j < i 
0 : otherwise. 


Then B a is again a shift on a certain subset of basis elements and the identity on the other basis elements, 
hence we have the following possible spectra: 

• sp (B a ) C {0,1} if {di} has finitely many Is. 

• sp (B a ) = T, the unit circle, if there are infinitely many i > 0 with a,; = 1 and infinitely many i < 0 
with a* = 1 (we say {a^} is two-sided infinite). 

• sp (B a ) = D, the unit disc, if {a,} has infinitely many Is, but only finitely many for i < 0 or finitely 
many for i > 0 (we say {aj} is one-sided infinite in that case). 

Next for a matrix we define the operator 

OO 

(14.7) C ~ 0 B k 

k=— oo 


on X, where B k = B{ a . fc j iez corresponds to the column {di,fe}iez in the above sense. Concerning its 
spectrum we have Ufeez sp(-Bfc) C sp(C') C D since ||C7|| = 1. Clearly, if one of the columns is one-sided 
infinite then sp(C') = D. The same holds true if for every k € N there is a finite column with at least k Is. 
Otherwise (that is if there is a number D such that for every column it holds that it either has less than D Is 
or is two-sided infinite) the spectrum sp(C) is a subset of {0} U T. Therefore if we had a height two tower 
{t\ m }, {I'rn.ri } for the computation of the spectrum of C or its counterpart in C> | then 


(14.8) 


f ro ,„(K,}) := Yes if and only if T mi „((7) (T S 1/4 ( 1/2) = 
f ■■= Yes if and only if Y m (C) (T B 1 /A (l/2) = 0 

would provide a height two tower for Sg, contradicting Theorem 17.21 


□ 


Remark 14.4. Note that Step VII in the proof of Theorem l7.2l follows immediately from the fact that the 
SCI of spectra and essential spectra of arbitrary operators are bounded by three and by translating respective 
height three towers similarly to (114.6b and (114.81 ). 


Proof of Theorem 17.31 Step I: SCI(S,Hi)g > 2 . Note that this results follows almost directly from the 
techniques used in the proof of Step III in the proof of Theorem l3.7l above. Indeed, we may define C as in 
(114.7b . and assume that there is a height two tower {T m }, {T m n } for the computation of E. We can now 
argue exactly as done in the proof of Step III in the proof of Theorem l3.7l and deduce that we can define a 
height two tower for Eg from Section [7TTl as follows: 

f m,n({ai,j}) ■■= Yes if and only if T min (C - -I) = No 

fm({aij}) := Yes if and only if T m (C - -I) = No. 

This would of course contradict Theorem l7.2l 

Step II: SCI(H, I2i)a < 3. Let A £ B(l 2 ( N)) and define the numbers 

7 := min{ui(^4),cri(A*)} 

7 m ■■= mm{a 1 (AP rn ),a 1 (A*P m )} 

:= min{cri(P n 2 lP m ), ai(P n A*P m )} 

:= rnin{2 _m fc : k £ N, 2 ~ m k > ai(P n AP m ) or 2 ~ m k > o x {P n A*P m )}, 
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where cri(B) := inf{||i?£|| : £ £ X, ||£|| = 1} for B £ B(l 2 (N)) is the smallest singular value of B 
and where the operators of the form P n AP m are regarded as operators/matrices in £>(RanP m , RanP„), 
respectively. It is well known that A is invertible if and only if 7 > 0. Further, note that 7 m | m 7 , and that 
7 m,n tn 7 m for every fixed m. The sequences {S m/T ,} n are bounded and monotonically non-decreasing, and 
7 m,n < 1 5 m ,n < 7 m,n + 2 _m < 7 m + 2~ m . Thus, for e > 0 there is an too, and for every to > too there is 
an no = no (to) such that 

(14.9) |7 - S mi n\ < |7 - 7m| + 17rn - 7m,n| + | 7 m,n - 5m,n| < e/3 + e/3 + 2~ m < £ 

whenever to > mo and n > no(m). So we see that the numbers <5 m>n converge monotonically from below 
for every masn -> oc, and the respective limits form a non-increasing sequence w.r.t. to, tending to 7 . 
Moreover, each <5 m „ can be computed with finitely many arithmetic operations by Proriosition llO.il Thus, 
if we define Tk,m,n(A) := (S m}n < fc” 1 ), we arrive at an Arithmetic tower of algorithms of height three 

sp°(A) = lim lim lim T k mn (A) 

k —>00 m —>00 n —>00 

for our problem, hence SCI(S, 1 1 1 ) 4 < 3. 

Step III: SCI(S, 0 , 3)4 < SC1(E. 0 2 ) A < 2. If one considers operators for which a bound / on their 
dispersion is known, then choosing n = /(to) turns (114.91) into 

(14.10) |7 | — |7 7 m| T | 7 m 7m,/(m) I T | 7 m,/(m) I — e /3 4 f/3 + 2 7 £ 

for large m taking \oi(BP m ) — ui{Pf(m)BP m )\ < || (/ — P/( m ))PPm|| into account. Therefore, a natural 
first guess for General algorithms could be Tk } m(A) := (<5 m ./(m) < fc -1 )- Unfortunately, although S m j( m ) 
converges to 7 as to —> 00 by (114.101) , this is not monotone in general. Hence, it might be the case that 
7 = fc -1 , but oscillates around fc -1 such that {rfc,m(7L)}m may not converge. To overcome this 

drawback, we consider the following algorithms 

Tk, m (A) := (|{i = 1 , • • •, to : (5 ij/(i ) < fc^ 1 )}! + \{i = 1 ,..., m : (5 ij/(i) < (fc + 1) _1 )}| > m). 

This converges in any case as to —> 00 . Indeed: If 7 > fc -1 then I\ m (7l) = Afo for sufficiently large to. If 
7 < (fc + l ) -1 then Fk t m(A) = Yes for sufficiently large to. If 7 = fc -1 then |{i = 1,..., to : < 

(fc + 1) -1 )}| are the same for all sufficiently large to and it follows that I\ m (Al) converges. Analogously 
for 7 = (fc + l) -1 . Finally, if (fc + l ) -1 < 7 < fc -1 then again one of these families is uniformly bounded 
and the other only misses a finite number of points, and we again get convergence. Now, it is clear that 
SCI(S, fl 2 ) A < 2. 

Step IV: SCI(S, > SCI(E, U, 3 jc, > 2. If we assume that there is a General height-one-tower 
of algorithms {i' n } over O 3 then we can again construct counterexamples very easily: For a decreasing 
sequence { a ,} of positive numbers we consider the diagonal operator A := cliag{a, }. Clearly, 0 belongs to 
the spectrum of A if and only if the ajS tend to zero. As a start, set {a/} := {1,1,...}, choose ni such that 
r„(diag{a/}) = No for all n > n\, and i\ such that r ni (diag{aj}) does not see the diagonal entries a\ with 
indices i > i\. This is possible by (iii) in Definition l2.3l Then set {a 2 } := {1,1,..., 1,1/2,1/2,...} with 
1 / 2s starting at the iith position. Ifni,, ^fe-i and i\,..., ik-i are already chosen then pick nk such that 
r„(diag{aj : }) = No for all n > nk, and ik such that T nfe (diag{af}) doesn’t see the diagonal entries a/ with 
indices i > ik, and modify {af} to {a^ +1 } := { 1 ,..., 2 ~ k , 2 ~ k ,...} with 2 -fe s starting at the ifcth position. 
Now, the contradiction is as in the previous proofs and we see that SCI(S, f^G > SCI(S, S 23 )g >2. □ 

Proof of Theorem \7.1 3\ Step I: We show that if SCI(S, Q)g < to then H is A m +i. Letp = lim iPi. Then 

p = true 44 Vn3fc(fc>nApfc) 44 3nVfc(fc < n V pk)- 

Further, let p : N —> N x N, fc 1 —> {(pi(k),(p 2 {k)) be a bijection which enumerates all pairs of natural 
numbers, and note that 


3n3m(p nt m) 44 3fc(p(g?i(fc), <p 2 (&))), VnVTO(p njm ) 44 Vfc(p(v?i(fc), ^ 2 (fc))), 
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for any family (p n ,m)n,meN C AT Thus, every limit in a tower of height to can be converted alternately into 
an expression with two quantifiers (V3 or 3V), and then to — 1 doubles 33 or W can be replaced by single 
quantifiers. This easily gives the claim (i) of Theorem l7.13l 

Step II: We show that if S is E m or II m then SCI(S, fl)c < to. As a start let (pf) C Ad be a sequence. 
Then 


(V*(pj)) = true 



= true , 


(3 i{pi)) = true <h> 


lim 

n—foo 


V* 


= true. 


\ »=l / \ *=l / 

Furthermore, the conjunction (disjunction) of limits coincides with the limit of the elementwise conjunction 
(disjunction), hence 


km-1 


Vn 

m 3 n m— 1 • • • VniT ri 


= 1 i m fc lhn -"lim A V ■"A 


i = l i m _i = l 


ii = l 


and similarly for any other possible alternating quantifier form. Since the T„ mi ... rai in the alternating quan¬ 
tifier form at the left hand side are General algorithms, the right hand side obviously yields a tower of 
algorithms of height to. 

Step HI: Let to € N be the smallest number with S being A m +i. In the above steps we have already 
seen that to < SC1(E, 0)g < to + 1, and we next prove the following: If 


S(y) =3i\/j(g 0 (i,j,y)) = \/n3m{gi(n, to, y)) 

then £(y) = lim^oo g(k , y) with a function g being easily derivable from go, g±. The following construc¬ 
tion is adopted from ll43] Proofs of Theorems 1 and 3]. Fix y and define a function ho : N —> A4 recursively 
as follows: 

i(l):=l, j(l) := 1 , h 0 (l) := g 0 (i(l),j(l),y). 

If h 0 (l) = true 

then: i(l + 1) := i(l), j(l + 1) := j{l) + 1 

else: i(l + 1) := i(l) + 1, j(Z + l):=l. 

l:=l + 1. 

ho(l) := g 0 (i{l),j(l),y). 

We observe that, if E(y) = true then ho(l) converges as l —> oo with limit true. Otherwise, the limit does 
not exist or is false. The same construction applies to -i(Vn 3 TO(< 7 i(n, to, y))) = m , y)) and 

yields a function hi which converges to true if and only if S (y) = false. Clearly, exactly one of the functions 
ho, hi converges to true. Now we derive the desired g from ho and hi as follows: 
a(l) = 0. 

If h a (k){k) = true 
then : a{k + 1) := a(k) 
else : a(k + 1) := 1 — a(k). 
k := k + 1. 

If a(k) = 0 

then: g(k,y) := true 

else: g{k,y) := false. 

This provides E(y) = g (k, y). 

Next, let g 0 and gi be of the form g s {i,j, y) = lim r ffj r {y), s € {0,1}. Fix y. Then for every pair (i,j) 
there is an r(i,j) such that /* v r (y) = g s {u, v, y) for all u < i, v < j, s € {0,1} and r > r(i,j). Thus, 
g is also of the form g(k, y ) = lim r fk,r{y) with fk, r being defined by the above procedure applied to the 
functions ( i,j , y) >-)■ ff ijtk {y) instead of g a (i,j, y) (s G {0,1}). 

Now we are left with iterating this argument: If both functions g s (s £ {0,1}) are of the form g s (i,j, y) = 
lim*™-! lim fem _ 2 • • • lim fcl fl j km li ... M (y) with certain General algorithms fl j km _ lr .. >kl , then also g is 
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of the form 

g{k, y) = lim lim ■lim/ fcifcm _ 1> ... ifcl (y) 

Km — 1 Km — 2 ^1 

with fk,k m -i,— ,ki being defined by the same procedure as before applied to the functions ( i,j,y ) 
fi,j,k m - i,...,fci(y) instead of g s {i,j,y) (s £ {0,1}). The resulting functions y fk,k m - x ,- M (v) are 
General algorithms for every k, since their evaluation requires only finitely many evaluations of the General 
algorithms□ 
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